Optimal Group Fair Classifiers from Linear Post-Processing (2405.04025v1)
Abstract: We propose a post-processing algorithm for fair classification that mitigates model bias under a unified family of group fairness criteria covering statistical parity, equal opportunity, and equalized odds, applicable to multi-class problems and both attribute-aware and attribute-blind settings. It achieves fairness by re-calibrating the output score of the given base model with a "fairness cost" -- a linear combination of the (predicted) group memberships. Our algorithm is based on a representation result showing that the optimal fair classifier can be expressed as a linear post-processing of the loss function and the group predictor, derived via using these as sufficient statistics to reformulate the fair classification problem as a linear program. The parameters of the post-processor are estimated by solving the empirical LP. Experiments on benchmark datasets show the efficiency and effectiveness of our algorithm at reducing disparity compared to existing algorithms, including in-processing, especially on larger problems.
- A Reductions Approach to Fair Classification. In Proceedings of the 35th International Conference on Machine Learning, 2018.
- Beyond Adult and COMPAS: Fair Multi-Class Prediction via Information Projection. In Advances in Neural Information Processing Systems, volume 35, 2022.
- Machine Bias. ProPublica, 2016. URL https://www.propublica.org/article/machine-bias-risk-assessments-in-criminal-sentencing.
- Neural Network Learning: Theoretical Foundations. Cambridge University Press, 1999.
- Big Data’s Disparate Impact. California Law Review, 104(3), 2016.
- Fairness and Machine Learning: Limitations and Opportunities. The MIT Press, 2023.
- AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias, 2018. arXiv:1810.01943 [cs.AI].
- Fairness in Criminal Justice Risk Assessments: The State of the Art. Sociological Methods & Research, 50(1), 2021.
- Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings. In Advances in Neural Information Processing Systems, volume 29, 2016.
- Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification. In Proceedings of the 2018 Conference on Fairness, Accountability, and Transparency, 2018.
- Building Classifiers with Independency Constraints. In 2009 IEEE International Conference on Data Mining Workshops, 2009.
- Optimized Pre-Processing for Discrimination Prevention. In Advances in Neural Information Processing Systems, volume 30, 2017.
- Towards Threshold Invariant Fair Classification. In Proceedings of the 36th Uncertainty in Artificial Intelligence Conference, 2020.
- Post-Hoc Bias Scoring is Optimal for Fair Classification. In The Twelfth International Conference on Learning Representations, 2024.
- Alexandra Chouldechova. Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments. Big Data, 5(2), 2017.
- Leveraging Labeled and Unlabeled Data for Consistent Fair Binary Classification. In Advances in Neural Information Processing Systems, volume 32, 2019.
- Fair Regression with Wasserstein Barycenters. In Advances in Neural Information Processing Systems, volume 33, 2020.
- Unprocessing Seven Years of Algorithmic Fairness. In The Twelfth International Conference on Learning Representations, 2024.
- Bias in Bios: A Case Study of Semantic Representation Bias in a High-Stakes Setting. In Proceedings of the 2019 Conference on Fairness, Accountability, and Transparency, 2019.
- Fairness guarantee in multi-class classification, 2023. arXiv:2109.13642 [math.ST].
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, volume 1, 2019.
- Retiring Adult: New Datasets for Fair Machine Learning. In Advances in Neural Information Processing Systems, volume 34, 2021.
- Fairness Through Awareness. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference, 2012.
- Model-as-a-Service (MaaS): A Survey, 2023. arxiv:2311.05804 [cs.AI].
- Fair learning with Wasserstein barycenters for non-decomposable performance measures. In Proceedings of the 26th International Conference on Artificial Intelligence and Statistics, 2023.
- Equality of Opportunity in Supervised Learning. In Advances in Neural Information Processing Systems, volume 29, 2016.
- Multicalibration: Calibration for the (Computationally-Identifiable) Masses. In Proceedings of the 35th International Conference on Machine Learning, 2018.
- Wasserstein Fair Classification. In Proceedings of the 36th Uncertainty in Artificial Intelligence Conference, 2020.
- Data preprocessing techniques for classification without discrimination. Knowledge and Information Systems, 33(1), 2012.
- Inherent Trade-Offs in the Fair Determination of Risk Scores. In 8th Innovations in Theoretical Computer Science Conference, volume 67, 2017.
- Ron Kohavi. Scaling Up the Accuracy of Naive-Bayes Classifiers: A Decision-Tree Hybrid. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, 1996.
- Projection to Fairness in Statistical Learning, 2020. arxiv:2005.11720 [cs.LG].
- Detecting and Correcting for Label Shift with Black Box Predictors. In Proceedings of the 35th International Conference on Machine Learning, 2018.
- Conditional Adversarial Domain Adaptation. In Advances in Neural Information Processing Systems, volume 31, 2018.
- The Cost of Fairness in Binary Classification. In Proceedings of the 2018 Conference on Fairness, Accountability, and Transparency, 2018.
- Foundations of Machine Learning. Adaptive Computation and Machine Learning Series. The MIT Press, second edition, 2018.
- Combinatorial Optimization: Algorithms and Complexity. Dover Publications, 1998.
- Null It Out: Guarding Protected Attributes by Iterative Nullspace Projection. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020.
- Understanding Machine Learning: From Theory to Algorithms. Cambridge University Press, 2014.
- Distribution Calibration for Regression. In Proceedings of the 36th International Conference on Machine Learning, 2019.
- Black-Box Tuning for Language-Model-as-a-Service. In Proceedings of the 39th International Conference on Machine Learning, 2022.
- FRAPPÉ: A Group Fairness Framework for Post-Processing Everything, 2024. arXiv:2312.02592 [cs.LG].
- Learning Non-Discriminatory Predictors. In Proceedings of the 30th Conference on Learning Theory, 2017.
- Efficient Post-Processing for Equal Opportunity in Fair Multi-Class Classification, 2023. URL https://openreview.net/forum?id=zKjSmbYFZe.
- Fair and Optimal Classification via Post-Processing. In Proceedings of the 40th International Conference on Machine Learning, 2023.
- Fairness with Overlapping Groups. In Advances in Neural Information Processing Systems, volume 33, 2020.
- Fairness Constraints: Mechanisms for Fair Classification. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, 2017.
- Learning Fair Representations. In Proceedings of the 30th International Conference on Machine Learning, 2013.
- Bayes-Optimal Classifiers under Group Fairness, 2022. arXiv:2202.09724 [stat.ML].
- Minimax Optimal Fair Classification with Bounded Demographic Disparity, 2024. arXiv:2403.18216 [stat.ML].
- Mitigating Unwanted Biases with Adversarial Learning. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, 2018.
- Inherent Tradeoffs in Learning Fair Representations. In Advances in Neural Information Processing Systems, volume 32, 2019.
- Conditional Learning of Fair Representations. In The Eighth International Conference on Learning Representations, 2020.
- Ruicheng Xian (9 papers)
- Han Zhao (159 papers)