Striking a Balance in Fairness for Dynamic Systems Through Reinforcement Learning (2401.06318v1)
Abstract: While significant advancements have been made in the field of fair machine learning, the majority of studies focus on scenarios where the decision model operates on a static population. In this paper, we study fairness in dynamic systems where sequential decisions are made. Each decision may shift the underlying distribution of features or user behavior. We model the dynamic system through a Markov Decision Process (MDP). By acknowledging that traditional fairness notions and long-term fairness are distinct requirements that may not necessarily align with one another, we propose an algorithmic framework to integrate various fairness considerations with reinforcement learning using both pre-processing and in-processing approaches. Three case studies show that our method can strike a balance between traditional fairness notions, long-term fairness, and utility.
- K. D. Johnson, D. P. Foster, and R. A. Stine, “Impartial predictive modeling: Ensuring fairness in arbitrary models,” Statistical Science, p. 1, 2016.
- R. S. Baker and A. Hawn, “Algorithmic bias in education,” International Journal of Artificial Intelligence in Education, pp. 1–41, 2021.
- L. Zhang, Y. Wu, and X. Wu, “Achieving non-discrimination in data release,” Proceedings of ACM SIGKDD’17, 2017.
- M. S. A. Lee and L. Floridi, “Algorithmic fairness in mortgage lending: from absolute conditions to relational trade-offs,” Minds and Machines, vol. 31, no. 1, pp. 165–191, 2021.
- C. Schumann, J. Foster, N. Mattei, and J. Dickerson, “We need fairness and explainability in algorithmic hiring,” in International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2020.
- R. Berk, H. Heidari, S. Jabbari, M. Kearns, and A. Roth, “Fairness in criminal justice risk assessments: The state of the art,” Sociological Methods & Research, vol. 50, no. 1, pp. 3–44, 2021.
- Y. Li, H. Chen, S. Xu, Y. Ge, J. Tan, S. Liu, and Y. Zhang, “Fairness in recommendation: A survey,” arXiv preprint arXiv:2205.13619, 2022.
- Z. Tang, J. Zhang, and K. Zhang, “What-is and how-to for fairness in machine learning: A survey, reflection, and perspective,” arXiv preprint arXiv:2206.04101, 2022.
- D. Pessach and E. Shmueli, “A review on fairness in machine learning,” ACM Computing Surveys (CSUR), vol. 55, no. 3, pp. 1–44, 2022.
- G. Alves, F. Bernier, M. Couceiro, K. Makhlouf, C. Palamidessi, and S. Zhioua, “Survey on fairness notions and related tensions,” EURO Journal on Decision Processes, p. 100033, 2023.
- L. T. Liu, S. Dean, E. Rolf, M. Simchowitz, and M. Hardt, “Delayed impact of fair machine learning,” in Proceedings of ICML’18, ser. Proceedings of Machine Learning Research, J. G. Dy and A. Krause, Eds., vol. 80. PMLR, 2018, pp. 3156–3164.
- D. Ensign, S. A. Friedler, S. Neville, C. Scheidegger, and S. Venkatasubramanian, “Runaway feedback loops in predictive policing,” in Conference on Fairness, Accountability and Transparency. PMLR, 2018, pp. 160–171.
- L. T. Liu, S. Dean, E. Rolf, M. Simchowitz, and M. Hardt, “Delayed impact of fair machine learning,” in International Conference on Machine Learning. PMLR, 2018, pp. 3150–3158.
- X. Zhang, R. Tu, Y. Liu, M. Liu, H. Kjellstrom, K. Zhang, and C. Zhang, “How do fair decisions fare in long-term qualification?” Advances in Neural Information Processing Systems, vol. 33, pp. 18 457–18 469, 2020.
- S. Jabbari, M. Joseph, M. Kearns, J. Morgenstern, and A. Roth, “Fairness in reinforcement learning,” in International Conference on Machine Learning. PMLR, 2017, pp. 1617–1626.
- A. D’Amour, H. Srinivasan, J. Atwood, P. Baljekar, D. Sculley, and Y. Halpern, “Fairness is not static: deeper understanding of long term fairness via simulation studies,” in Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, 2020, pp. 525–534.
- J. Atwood, H. Srinivasan, Y. Halpern, and D. Sculley, “Fair treatment allocations in social networks,” arXiv preprint arXiv:1911.05489, 2019.
- E. Y. Yu, Z. Qin, M. K. Lee, and S. Gao, “Policy optimization with advantage regularization for long-term fairness in decision systems,” NeurIPS, 2022.
- M. Wen, O. Bastani, and U. Topcu, “Algorithms for fairness in sequential decision making,” in International Conference on Artificial Intelligence and Statistics. PMLR, 2021, pp. 1144–1152.
- L. Hu and Y. Chen, “A short-term intervention for long-term fairness in the labor market,” in Proceedings of the 2018 World Wide Web Conference, 2018, pp. 1389–1398.
- F. Kamiran and T. Calders, “Classifying without discriminating,” in 2009 2nd international conference on computer, control and communication. IEEE, 2009, pp. 1–6.
- A. Bower, S. N. Kitchen, L. Niss, M. J. Strauss, A. Vargas, and S. Venkatasubramanian, “Fair pipelines,” arXiv preprint arXiv:1707.00391, 2017.
- C. Dwork and C. Ilvento, “Fairness under composition,” arXiv preprint arXiv:1806.06122, 2018.
- E. Creager, D. Madras, T. Pitassi, and R. Zemel, “Causal modeling for fairness in dynamical systems,” in International Conference on Machine Learning. PMLR, 2020, pp. 2185–2195.
- Y. Hu, Y. Wu, L. Zhang, and X. Wu, “Fair multiple decision making through soft interventions,” Advances in Neural Information Processing Systems, vol. 33, pp. 17 965–17 975, 2020.
- Y. Hu and L. Zhang, “Achieving long-term fairness in sequential decision making,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 36, 2022, pp. 9549–9557.
- Y. Ge, S. Liu, R. Gao, Y. Xian, Y. Li, X. Zhao, C. Pei, F. Sun, J. Ge, W. Ou et al., “Towards long-term fairness in recommendation,” in Proceedings of the 14th ACM International Conference on Web Search and Data Mining, 2021, pp. 445–453.
- F. Kamiran and T. Calders, “Data preprocessing techniques for classification without discrimination,” Knowledge and Information Systems, vol. 33, no. 1, pp. 1–33, 2012.
- M. Feldman, S. A. Friedler, J. Moeller, C. Scheidegger, and S. Venkatasubramanian, “Certifying and removing disparate impact,” in proceedings of ACM SIGKDD’15, 2015, pp. 259–268.
- D. Xu, S. Yuan, L. Zhang, and X. Wu, “Fairgan: Fairness-aware generative adversarial networks,” in 2018 IEEE International Conference on Big Data (Big Data). IEEE, 2018, pp. 570–575.
- N. Quadrianto and V. Sharmanska, “Recycling privileged learning and distribution matching for fairness,” Advances in neural information processing systems, vol. 30, 2017.
- M. P. Kim, A. Korolova, G. N. Rothblum, and G. Yona, “Preference-informed fairness,” arXiv preprint arXiv:1904.01793, 2019.
- M. B. Zafar, I. Valera, M. Rodriguez, K. Gummadi, and A. Weller, “From parity to preference-based notions of fairness in classification,” Advances in neural information processing systems, vol. 30, 2017.
- M. Hardt, E. Price, and N. Srebro, “Equality of opportunity in supervised learning,” in NIPS, 2016.
- S. Corbett-Davies, E. Pierson, A. Feller, S. Goel, and A. Huq, “Algorithmic decision making and the cost of fairness,” in Proceedings of the 23rd acm sigkdd international conference on knowledge discovery and data mining, 2017, pp. 797–806.
- A. K. Menon and R. C. Williamson, “The cost of fairness in binary classification,” in Conference on Fairness, accountability and transparency. PMLR, 2018, pp. 107–118.
- J. Schulman, P. Moritz, S. Levine, M. Jordan, and P. Abbeel, “High-dimensional continuous control using generalized advantage estimation,” arXiv preprint arXiv:1506.02438, 2015.
- R. S. Sutton, D. McAllester, S. Singh, and Y. Mansour, “Policy gradient methods for reinforcement learning with function approximation,” Advances in neural information processing systems, vol. 12, 1999.
- J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv preprint arXiv:1707.06347, 2017.
- J. Schulman, S. Levine, P. Abbeel, M. Jordan, and P. Moritz, “Trust region policy optimization,” in International conference on machine learning. PMLR, 2015, pp. 1889–1897.
- S. Barocas, M. Hardt, and A. Narayanan, “Fairness in machine learning,” Nips tutorial, vol. 1, p. 2017, 2017.
- M. Girvan and M. E. Newman, “Community structure in social and biological networks,” Proceedings of the national academy of sciences, vol. 99, no. 12, pp. 7821–7826, 2002.
- Yaowei Hu (4 papers)
- Jacob Lear (2 papers)
- Lu Zhang (373 papers)