BaBE: Enhancing Fairness via Estimation of Latent Explaining Variables
Abstract: We consider the problem of unfair discrimination between two groups and propose a pre-processing method to achieve fairness. Corrective methods like statistical parity usually lead to bad accuracy and do not really achieve fairness in situations where there is a correlation between the sensitive attribute S and the legitimate attribute E (explanatory variable) that should determine the decision. To overcome these drawbacks, other notions of fairness have been proposed, in particular, conditional statistical parity and equal opportunity. However, E is often not directly observable in the data, i.e., it is a latent variable. We may observe some other variable Z representing E, but the problem is that Z may also be affected by S, hence Z itself can be biased. To deal with this problem, we propose BaBE (Bayesian Bias Elimination), an approach based on a combination of Bayes inference and the Expectation-Maximization method, to estimate the most likely value of E for a given Z for each group. The decision can then be based directly on the estimated E. We show, by experiments on synthetic and real data sets, that our approach provides a good level of fairness as well as high accuracy.
- Dakshi Agrawal and Charu C. Aggarwal. 2001. On the Design and Quantification of Privacy Preserving Data Mining Algorithms. In Proceedings of the Twentieth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (Santa Barbara, California, USA) (PODS ’01). Association for Computing Machinery, New York, NY, USA, 247–255. https://doi.org/10.1145/375551.375602
- E. Bareinboim and J. Pearl. 2013a. Causal Transportability with Limited Experiments. In AAAI.
- Elias Bareinboim and Judea Pearl. 2013b. A General Algorithm for Deciding Transportability of Experimental Results. Journal of Causal Inference 1, 1 (may 2013), 107–134. https://doi.org/10.1515/jci-2012-0004
- Elias Bareinboim and Judea Pearl. 2014. Transportability from multiple environments with limited experiments: Completeness results. Advances in neural information processing systems 27 (2014).
- AI Fairness 360: An extensible toolkit for detecting and mitigating algorithmic bias. IBM Journal of Research and Development 63, 4/5 (2019), 4–1.
- BaBE: Enhancing Fairness via Estimation of Latent Explaining Variables. arXiv preprint arXiv:2307.02891 (2023).
- Toon Calders and Sicco Verwer. 2010. Three naive Bayes approaches for discrimination-free classification. Data Min. Knowl. Discov. 21 (09 2010), 277–292. https://doi.org/10.1007/s10618-010-0190-x
- Silvia Chiappa. 2019. Path-specific counterfactual fairness. In Proceedings of the AAAI conference on artificial intelligence, Vol. 33. 7801–7808.
- Group Fairness by Probabilistic Modeling with Latent Fair Decisions. CoRR abs/2009.09031 (2020).
- Algorithmic decision making and the cost of fairness. In Proceedings of the 23rd acm sigkdd international conference on knowledge discovery and data mining. 797–806.
- Maximum likelihood from incomplete data via the EM algorithm. Proceedings of the Royal Statistical Society B-39 (1977), 1–38.
- Fairness through awareness. In Proceedings of the 3rd innovations in theoretical computer science conference. 214–226.
- Ehab ElSalamouny and Catuscia Palamidessi. 2020. Generalized Iterative Bayesian Update and Applications to Mechanisms for Privacy Protection. In 2020 IEEE European Symposium on Security and Privacy (EuroS&P). IEEE, 490–507.
- Certifying and Removing Disparate Impact. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Sydney, NSW, Australia) (KDD ’15). Association for Computing Machinery, New York, NY, USA, 259–268. https://doi.org/10.1145/2783258.2783311
- National Center for Health Statistics. NHANES. https://www.cdc.gov/nchs/nhanes/about_nhanes.htmhttps://www.cdc.gov/nchs/nhanes/about\_nhanes.htmitalic_h italic_t italic_t italic_p italic_s : / / italic_w italic_w italic_w . italic_c italic_d italic_c . italic_g italic_o italic_v / italic_n italic_c italic_h italic_s / italic_n italic_h italic_a italic_n italic_e italic_s / italic_a italic_b italic_o italic_u italic_t _ italic_n italic_h italic_a italic_n italic_e italic_s . italic_h italic_t italic_m.
- The (Im)possibility of fairness: different value systems require different mechanisms for fair decision making. Commun. ACM 64, 4 (2021), 136–143.
- Causal inference in statistics: A primer. John Wiley & Sons.
- Take Two! SAT Retaking and College Enrollment Gaps. American Economic Journal: Economic Policy 12, 2 (May 2020), 115–58. https://doi.org/10.1257/pol.20170503
- Brenda Hannon. 2012. Test Anxiety and Performance-Avoidance Goals Explain Gender Differences in SAT-V, SAT-M, and Overall SAT Scores. Personality and individual differences 53 (11 2012), 816–820. https://doi.org/10.1016/j.paid.2012.06.003
- Equality of Opportunity in Supervised Learning. In Advances in Neural Information Processing Systems, D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, and R. Garnett (Eds.), Vol. 29. Curran Associates, Inc. https://proceedings.neurips.cc/paper/2016/file/9d2682367c3935defcb1f9e247a97c0d-Paper.pdf
- Dirk Heerwegh. 2014. Small sample Bayesian factor analysis. Phuse. Retrieved from http://www. lexjansen. com/phuse/2014/sp/SP03. pdf (2014).
- Fair Inference for Discrete Latent Variable Models. arXiv preprint arXiv:2209.07044 (2022).
- Quantifying explainable discrimination and removing illegal discrimination in automated decision making. Knowledge and Information Systems 35, 3 (June 2013), 613–644. https://doi.org/10.1007/s10115-012-0584-8
- Counterfactual Fairness. CoRR abs/1703.06856 (2017). http://arxiv.org/abs/1703.06856
- Dayoon Kwon and Daniel W Belsky. 2021. A toolkit for quantification of biological age from blood chemistry and organ function test data: BioAge. GeroScience 43 (2021), 2795–2808.
- Oxidative Stress Factors Mediate the Association Between Life’s Essential 8 and Accelerated Phenotypic Aging: NHANES 2005-2018. The Journals of Gerontology: Series A (2023), glad240.
- Causal effect inference with deep latent-variable models. Advances in neural information processing systems 30 (2017).
- The Variational Fair Autoencoder. In 4th International Conference on Learning Representations, ICLR 2016, San Juan, Puerto Rico, May 2-4, 2016, Conference Track Proceedings, Yoshua Bengio and Yann LeCun (Eds.). http://arxiv.org/abs/1511.00830
- Fairness through causal awareness: Learning causal latent-variable models for biased data. In Proceedings of the conference on fairness, accountability, and transparency. 349–358.
- Geoffrey J McLachlan and Thriyambakam Krishnan. 2007. The EM algorithm and extensions. John Wiley & Sons.
- Daniel McNeish. 2016. On using Bayesian methods to address small sample problems. Structural Equation Modeling: A Multidisciplinary Journal 23, 5 (2016), 750–773.
- Biological aging and periodontal disease: analysis of NHANES (2001–2002). JDR Clinical & Translational Research 7, 2 (2022), 145–153.
- Dissecting racial bias in an algorithm used to manage the health of populations. Science 366, 6464 (2019), 447–453.
- Can you trust your model’s uncertainty? evaluating predictive uncertainty under dataset shift. Advances in neural information processing systems 32 (2019).
- Judea Pearl and Elias Bareinboim. 2014. External Validity: From Do-Calculus to Transportability Across Populations. Statist. Sci. 29, 4 (nov 2014). https://doi.org/10.1214/14-sts486
- Scikit-learn: Machine learning in Python. the Journal of machine Learning research 12 (2011), 2825–2830.
- Dataset shift in machine learning. Mit Press.
- Towards Causal Representation Learning. CoRR abs/2102.11107 (2021).
- C. F. Jeff Wu. 1983. On the Convergence Properties of the EM Algorithm. The Annals of Statistics 11, 1 (1983), 95–103.
- Blunted rest–activity circadian rhythm is associated with increased rate of biological aging: an analysis of NHANES 2011–2014. The Journals of Gerontology: Series A 78, 3 (2023), 407–413.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.