Reconciling Predictive and Statistical Parity: A Causal Approach (2306.05059v2)
Abstract: Since the rise of fair machine learning as a critical field of inquiry, many different notions on how to quantify and measure discrimination have been proposed in the literature. Some of these notions, however, were shown to be mutually incompatible. Such findings make it appear that numerous different kinds of fairness exist, thereby making a consensus on the appropriate measure of fairness harder to reach, hindering the applications of these tools in practice. In this paper, we investigate one of these key impossibility results that relates the notions of statistical and predictive parity. Specifically, we derive a new causal decomposition formula for the fairness measures associated with predictive parity, and obtain a novel insight into how this criterion is related to statistical parity through the legal doctrines of disparate treatment, disparate impact, and the notion of business necessity. Our results show that through a more careful causal analysis, the notions of statistical and predictive parity are not really mutually exclusive, but complementary and spanning a spectrum of fairness notions through the concept of business necessity. Finally, we demonstrate the importance of our findings on a real-world example.
- Machine bias: There’s software used across the country to predict future criminals. And it’s biased against blacks. ProPublica.
- On Pearl’s Hierarchy and the Foundations of Causal Inference. In Probabilistic and Causal Inference: The Works of Judea Pearl, 507–556. New York, NY, USA: Association for Computing Machinery, 1st edition.
- Fairness in machine learning. Nips tutorial, 1: 2017.
- Big data’s disparate impact. Calif. L. Rev., 104: 671.
- Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification. In Friedler, S. A.; and Wilson, C., eds., Proceedings of the 1st Conference on Fairness, Accountability and Transparency, volume 81 of Proceedings of Machine Learning Research, 77–91. NY, USA.
- Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 785–794.
- Chiappa, S. 2019. Path-specific counterfactual fairness. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, 7801–7808.
- Chouldechova, A. 2017. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Technical Report arXiv:1703.00056, arXiv.org.
- The measure and mismeasure of fairness: A critical review of fair machine learning. arXiv preprint arXiv:1808.00023.
- Counterfactual risk assessments, evaluation, and fairness. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, 582–593.
- Darlington, R. B. 1971. Another Look At “Cultural Fairness”. Journal of Educational Measurement, 8(2): 71–82.
- The fight against financial advertisers using Facebook for digital redlining.
- Harwell, D. 2019. Federal study confirms racial bias of many facial-recognition systems, casts doubt on their expanding use. https://www.washingtonpost.com/technology/2019/12/19/federal-study-confirms-racial-bias-many-facial-recognition-systems-casts-doubt-their-expanding-use/.
- Avoiding discrimination through causal reasoning. arXiv preprint arXiv:1706.02744.
- Fairness and robustness in anti-causal prediction. arXiv preprint arXiv:2209.09423.
- Fair inference on outcomes. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 32.
- Pearl, J. 2000. Causality: Models, Reasoning, and Inference. New York: Cambridge University Press. 2nd edition, 2009.
- Causal Fairness Analysis. arXiv preprint arXiv:2207.11385. (To appear in Foundations and Trends in Machine Learning).
- fairadapt: Causal Reasoning for Fair Data Pre-processing. arXiv preprint arXiv:2110.10200.
- Fair data adaptation with quantile preservation. Journal of Machine Learning Research, 21: 242.
- Ensuring fairness in machine learning to advance health equity. Annals of internal medicine, 169(12): 866–872.
- Probabilities of causation: Bounds and identification. Annals of Mathematics and Artificial Intelligence, 28(1): 287–313.
- U.S. Census Bureau. 2018. Public Use Microdata Sample (PUMS), 2018 American Community Survey. https://www.census.gov/programs-surveys/acs/microdata/documentation.html. [Online; accessed 18-07-2021].
- Wright, S. 1934. The method of path coefficients. The annals of mathematical statistics, 5(3): 161–215.
- Pc-fairness: A unified framework for measuring causality-based fairness. Advances in neural information processing systems, 32.
- Equality of Opportunity in Classification: A Causal Approach. In Bengio, S.; Wallach, H.; Larochelle, H.; Grauman, K.; Cesa-Bianchi, N.; and Garnett, R., eds., Advances in Neural Information Processing Systems 31, 3671–3681. Montreal, Canada: Curran Associates, Inc.
- Fairness in decision-making—the causal explanation formula. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 32.
- Partial Counterfactual Identification from Observational and Experimental Data. In Proceedings of the 39th International Conference on Machine Learning.
- Drago Plecko (12 papers)
- Elias Bareinboim (34 papers)