Algorithmic decision making methods for fair credit scoring (2209.07912v3)
Abstract: The effectiveness of machine learning in evaluating the creditworthiness of loan applicants has been demonstrated for a long time. However, there is concern that the use of automated decision-making processes may result in unequal treatment of groups or individuals, potentially leading to discriminatory outcomes. This paper seeks to address this issue by evaluating the effectiveness of 12 leading bias mitigation methods across 5 different fairness metrics, as well as assessing their accuracy and potential profitability for financial institutions. Through our analysis, we have identified the challenges associated with achieving fairness while maintaining accuracy and profitabiliy, and have highlighted both the most successful and least successful mitigation methods. Ultimately, our research serves to bridge the gap between experimental machine learning and its practical applications in the finance industry.
- Ai-powered decision making for the bank of the future. McKinsey & Company.–2021.–March.–URL: https://www. mckinsey. com/~/media/mckinsey/industries/financial% 20services/our% 20insights/ai% 20powered% 20decision% 20making% 20for% 20the% 20bank% 20of% 20the% 20future/ai-powered-decision-making-forthe-bank-of-the-future. pdf (15.04. 2021), 2021.
- Unpacking the black box: Regulating algorithmic decisions. arXiv preprint arXiv:2110.03443, 2021.
- The fairness of credit scoring models. Available at SSRN 3785882, 2021.
- Federico Ferretti. The Never-Ending European Credit Data Mess. Technical Report BEUC-X-2017-111, The European Consumer Organisation, Brussels, Belgium, October 2017.
- European Commission. White paper on artificial intelligence: A european approach to excellence and trust. Com (2020) 65 Final, 2020.
- Human Rights Council. The right to privacy in the digital age. U.N. Doc. A/HRC/48/31, 2021.
- Fairness and Machine Learning. fairmlbook.org, 2019.
- Fairness through awareness. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference, ITCS ’12, page 214–226, New York, NY, USA, 2012. Association for Computing Machinery.
- Equality of opportunity in supervised learning. Advances in neural information processing systems, 29, 2016.
- Compas risk scales: Demonstrating accuracy equity and predictive parity. Northpointe Inc, 7(4), 2016.
- The problem of infra-marginality in outcome tests for discrimination. The Annals of Applied Statistics, 11(3):1193–1216, 2017.
- Inherent trade-offs in the fair determination of risk scores. In 8th Innovations in Theoretical Computer Science Conference (ITCS 2017), volume 67, page 43. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 2017.
- Dynamic fairness-breaking vicious cycles in automatic decision making. In Proceedings of the 27th European Symposium on Artificial Neural Networks (ESANN 2019), 2019.
- Algorithmic fairness in credit scoring. Oxford Review of Economic Policy, 37(3):585–617, 2021.
- Algorithmic and economic perspectives on fairness. arXiv preprint arXiv:1909.05282, 2019.
- A clarification of the nuances in the fairness metrics landscape. Scientific Reports, 12(1):1–21, 2022.
- Fairness in credit scoring: Assessment, implementation and profit implications. European Journal of Operational Research, 297(3):1083–1094, 2022.
- Certifying and removing disparate impact. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’15, page 259–268, New York, NY, USA, 2015. Association for Computing Machinery.
- Preventing fairness gerrymandering: Auditing and learning for subgroup fairness. In Jennifer Dy and Andreas Krause, editors, Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pages 2564–2572. PMLR, 10–15 Jul 2018.
- A unified approach to quantifying algorithmic unfairness: Measuring individual &group unfairness via inequality indices. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pages 2239–2248, 2018.
- Predictably unequal? the effects of machine learning on credit markets. The Journal of Finance, 77(1):5–47, 2022.
- Tal Zarsky. The trouble with algorithmic decisions: An analytic road map to examine efficiency and fairness in automated and opaque decision making. Science, Technology, & Human Values, 41(1):118–132, 2016.
- Credit scoring in the era of big data. Yale JL & Tech., 18:148, 2016.
- Delayed impact of fair machine learning. In International Conference on Machine Learning, pages 3150–3158. PMLR, 2018.
- Causal modeling for fairness in dynamical systems. In International Conference on Machine Learning, pages 2185–2195. PMLR, 2020.
- Fairness by explicability and adversarial shap learning. In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pages 174–190. Springer, 2020.
- Michelle Seng Ah Lee and Luciano Floridi. Algorithmic fairness in mortgage lending: from absolute conditions to relational trade-offs. Minds and Machines, 31(1):165–191, 2021.
- Cost-sensitive learning for profit-driven credit scoring. Journal of the Operational Research Society, 73(2):338–350, 2022.
- Fair decisions despite imperfect predictions. In International Conference on Artificial Intelligence and Statistics, pages 277–287. PMLR, 2020.
- Facing the challenges of developing fair risk scoring models. Frontiers in artificial intelligence, 4, 2021.
- Building classifiers with independency constraints. In 2009 IEEE International Conference on Data Mining Workshops, pages 13–18, 2009.
- Data preprocessing techniques for classification without discrimination. Knowledge and information systems, 33(1):1–33, 2012.
- Learning fair representations. In International conference on machine learning, pages 325–333. PMLR, 2013.
- Mitigating unwanted biases with adversarial learning. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, pages 335–340, 2018.
- A reductions approach to fair classification. In International Conference on Machine Learning, pages 60–69. PMLR, 2018.
- Fair regression: Quantitative definitions and reduction-based algorithms. In International Conference on Machine Learning, pages 120–129. PMLR, 2019.
- Classification with fairness constraints: A meta-algorithm with provable guarantees. In Proceedings of the conference on fairness, accountability, and transparency, pages 319–328, 2019.
- Fairness-aware classifier with prejudice remover regularizer. In Joint European conference on machine learning and knowledge discovery in databases, pages 35–50. Springer, 2012.
- Decision theory for discrimination-aware classification. In 2012 IEEE 12th International Conference on Data Mining, pages 924–929. IEEE, 2012.
- On fairness and calibration. Advances in neural information processing systems, 30, 2017.
- UCI machine learning repository, 2017.
- Ai fairness 360: An extensible toolkit for detecting and mitigating algorithmic bias. IBM Journal of Research and Development, 63(4/5):4–1, 2019.
- A survey on datasets for fairness-aware machine learning. WIREs Data Mining and Knowledge Discovery, 12(3):e1452, 2022.
- Improving fairness of ai systems with lossless de-biasing. arXiv preprint arXiv:2105.04534, 2021.
- Classifying without discriminating. In 2009 2nd International Conference on Computer, Control and Communication, pages 1–6, 2009.
- Development and application of consumer credit scoring models using profit-based classification measures. European Journal of Operational Research, 238(2):505–513, 2014.
- Optimized pre-processing for discrimination prevention. Advances in neural information processing systems, 30, 2017.
- Stable and fair classification. In International Conference on Machine Learning, pages 2879–2890. PMLR, 2019.
- A comparative study of fairness-enhancing interventions in machine learning. In Proceedings of the conference on fairness, accountability, and transparency, pages 329–338, 2019.
- The independence of fairness-aware classifiers. In 2013 IEEE 13th International Conference on Data Mining Workshops, pages 849–858. IEEE, 2013.
- Impact of resampling methods and classification models on the imbalanced credit scoring problems. Information Sciences, 569:508–526, 2021.
- Deep learning for credit scoring: Do or don’t? European Journal of Operational Research, 295(1):292–305, 2021.
- To do or not to do? cost-sensitive causal classification with individual treatment effect estimates. European Journal of Operational Research, 2022.
- The carbon footprint of machine learning training will plateau, then shrink. Computer, 55(7):18–28, 2022.