Fair Machine Guidance to Enhance Fair Decision Making in Biased People (2404.05228v1)
Abstract: Teaching unbiased decision-making is crucial for addressing biased decision-making in daily life. Although both raising awareness of personal biases and providing guidance on unbiased decision-making are essential, the latter topics remains under-researched. In this study, we developed and evaluated an AI system aimed at educating individuals on making unbiased decisions using fairness-aware machine learning. In a between-subjects experimental design, 99 participants who were prone to bias performed personal assessment tasks. They were divided into two groups: a) those who received AI guidance for fair decision-making before the task and b) those who received no such guidance but were informed of their biases. The results suggest that although several participants doubted the fairness of the AI system, fair machine guidance prompted them to reassess their views regarding fairness, reflect on their biases, and modify their decision-making criteria. Our findings provide insights into the design of AI systems for guiding fair decision-making in humans.
- A Reductions Approach to Fair Classification. In Proceedings of the 2018 International Conference on Machine Learning. https://doi.org/10.48550/arXiv.1803.02453
- Machine Bias. (2022), 254–264.
- Becoming Homeless: A Human Experience. In Proceedings of the 2018 ACM SIGGRAPH Virtual, Augmented, and Mixed Reality. https://doi.org/10.1145/3226552.3226576
- CrowDEA: Multi-view Idea Prioritization with Crowds. In Proceedings of the 2020 AAAI Conference on Human Computation and Crowdsourcing. https://doi.org/10.48550/arXiv.2008.02354
- “If I Had All the Time in the World”: Ophthalmologists’ Perceptions of Anchoring Bias Mitigation in Clinical AI Support. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3544548.3581513
- Barry Becker and Ronny Kohavi. 1996. Adult. UCI Machine Learning Repository. https://doi.org/10.24432/C5XW20
- Marianne Bertrand and Sendhil Mullainathan. 2004. Are Emily and Greg More Employable than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination. American Economic Review 94, 4 (2004), 991–1013. https://doi.org/10.1257/0002828042002561
- Jack W. Brehm. 1966. A Theory of Psychological Reactance. 135 (1966).
- A Systematic Review of Algorithm Aversion in Augmented Decision Making. Journal of Behavioral Decision Making 33, 2 (2020), 220–239. https://doi.org/10.1002/bdm.2155
- Toon Calders and Sicco Verwer. 2010. Three Naive Bayes Approaches for Discrimination-free Classification. Data Mining and Knowledge Discovery 21, 2 (2010), 277–292. https://doi.org/10.1007/s10618-010-0190-x
- The Mixed Effects of Online Diversity Training. National Academy of Sciences 116, 16 (2019), 7778–7783. https://doi.org/10.1073/pnas.1816076116
- News Literacy Education in a Polarized Political Climate: How Games Can Teach Youth to Spot Misinformation. Harvard Kennedy School Misinformation Review 1 (2020). https://doi.org/10.37016/mr-2020-020
- Near-optimal Machine Teaching via Explanatory Teaching Sets. In Proceedings of the 2018 International Conference on Artificial Intelligence and Statistics.
- How Child Welfare Workers Reduce Racial Disparities in Algorithmic Decisions. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3501831
- Soliciting Stakeholders’ Fairness Notions in Child Maltreatment Predictive Systems. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3411764.3445308
- Are Two Heads Better than One in AI-assisted Decision Making? Comparing the Behavior and Performance of Groups and Individuals in Human-AI Collaborative Recidivism Risk Assessment. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3544548.3581015
- A Case for Humans-in-the-Loop: Decisions in the Presence of Erroneous Algorithmic Scores. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3313831.3376638
- Algorithm Aversion: People Erroneously Avoid Algorithms after Seeing Them Err. Journal of Experimental Psychology: General 144, 1 (2015), 114. https://doi.org/10.2139/ssrn.2466040
- Method for Appropriating the Brief Implicit Association Test to Elicit Biases in Users. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3517570
- Fairness through Awareness. In Proceedings of the 2012 Innovations in Theoretical Computer Science Conference. https://doi.org/10.1145/2090236.2090255
- Diversity of Inference Strategies can Enhance the ‘Wisdom-of-Crowds’ effect. Palgrave Communications 4, 1 (2018), 1â9. https://doi.org/10.1057/s41599-018-0161-1
- Krzysztof Z. Gajos and Lena Mamykina. 2022. Do People Engage Cognitively with AI? Impact of AI Assistance on Incidental Learning. In Proceedings of the 2022 International Conference on Intelligent User Interfaces. https://doi.org/10.1145/3490099.3511138
- Adam D. Galinsky and Gordon B. Moskowitz. 2007. Further Ironies of Suppression: Stereotype and Counterstereotype Accessibility. Journal of Experimental Social Psychology 43, 5 (2007), 833–841. https://doi.org/10.1016/j.jesp.2006.09.001
- Meric Altug Gemalmaz and Ming Yin. 2022. Understanding Decision Subjects’ Fairness Perceptions and Retention in Repeated Interactions with AI-based Decision Systems. In Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society. https://doi.org/10.1145/3514094.3534201
- Alexandra Goedderz and Adam Hahn. 2022. Biases Left Unattended: People are Surprised at Racial Bias Feedback until They Pay Attention to Their Biased Reactions. Journal of Experimental Social Psychology 102 (2022), 104374. https://doi.org/10.1016/j.jesp.2022.104374
- Claudia Goldin and Cecilia Rouse. 2000. Orchestrating Impartiality: The Impact of “Blind” Auditions on Female Musicians. American Economic Review 90, 4 (2000), 715–741. https://doi.org/10.1257/aer.90.4.715
- Measuring Individual Differences in Implicit Cognition: The Implicit Association Test. Journal of Personality and Social Psychology 74, 6 (1998), 1464. https://doi.org/10.1037/0022-3514.74.6.1464
- Marie-Pascale Grimon and Christopher Mills. 2022. The Impact of Algorithmic Tools on Child Protection: Evidence from a Randomized Controlled Trial. Job Market Paper (2022).
- Kent D. Harber. 1998. Feedback to Minorities: Evidence of a Positive Bias. Journal of Personality and Social Psychology 74, 3 (1998), 622. https://doi.org/10.1037/0022-3514.74.3.622
- Equality of Opportunity in Supervised Learning. Advances in Neural Information Processing Systems 29 (2016). https://doi.org/10.48550/arXiv.1610.02413
- Christopher Harris. 2020. Mitigating Cognitive Biases in Machine Learning Algorithms for Decision Making. In Proceedings of the 2020 Companion Proceedings of the Web Conference. https://doi.org/10.1145/3366424.3383562
- Peter Hase and Mohit Bansal. 2020. Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?. In Proceedings of the 2020 Annual Meeting of the Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.acl-main.491
- Knowing about Knowing: An Illusion of Human Competence Can Hinder Appropriate Reliance on AI Systems. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3544548.3581025
- Edward R. Hirt and Keith D. Markman. 1995. Multiple Explanation: A Consider-an-alternative Strategy for Debiasing Judgments. Journal of Personality and Social Psychology 69, 6 (1995), 1069. https://doi.org/10.1037/0022-3514.69.6.1069
- Hans Hofmann. 1994. Statlog (German Credit Data). UCI Machine Learning Repository. https://doi.org/10.24432/C5NC77
- Teaching Multiple Concepts to a Forgetful Learner. Advances in Neural Information Processing Systems 32 (2019). https://doi.org/10.5555/3454287.3454651
- Generalized Demographic Parity for Group Fairness. In Proceedings of the 2022 International Conference on Learning Representations.
- “Because AI is 100% Right and Safe”: User Attitudes and Sources of AI Authority in India. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3517533
- Jennifer Y. Kim and Loriann Roberson. 2022. I’m Biased and so are You. What Should Organizations Do? A Review of Organizational Implicit-bias Training Programs. Consulting Psychology Journal 74, 1 (2022), 19. https://doi.org/10.1037/cpb0000211
- Sneaking in through the Back Door: How Category-based Stereotype Suppression Leads to Rebound in Feature-based Effects. Journal of Experimental Social Psychology 44, 3 (2008), 833–839.
- Regina König and Angela Heine. 2023. Learning to Detect Sexism: An Evaluation of the Effects of a Brief Video-based Intervention Using ROC Analysis. Frontiers in Psychology 13 (2023), 1005633. https://doi.org/10.1016/j.jesp.2007.07.011
- To Self-persuade or be Persuaded: Examining Interventions for Users’ Privacy Setting Selection. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3502009
- Boosting Medical Diagnostics by Pooling Independent Judgments. Proceedings of the National Academy of Sciences of the United States of America 113, 31 (2016), 8777–8782. https://doi.org/10.1073/pnas.1601827113
- Towards a Science of Human-AI Decision Making: An Overview of Design Space in Empirical Human-subject Studies. In Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency. https://doi.org/10.1145/3593013.3594087
- Richard P. Larrick. 2004. Debiasing. Blackwell Handbook of Judgment and Decision Making (2004), 316–338. https://doi.org/10.1002/9780470752937.ch16
- Race, Gender and Beauty: The Effect of Information Provision on Online Hiring Biases. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3313831.3376874
- Yiqiao Liao and Parinaz Naghizadeh. 2023. Social Bias Meets Data Bias: The Impacts of Labeling and Measurement Errors on Fairness Criteria. In Proceedings of the 2023 AAAI Conference on Artificial Intelligence. https://doi.org/10.1609/aaai.v37i7.26054
- Sarah Lichtenstein and Baruch Fischhoff. 1980. Training for Calibration. Organizational Behavior and Human Performance 26, 2 (1980), 149–171. https://doi.org/10.1016/0030-5073(80)90052-5
- Human Perceptions on Moral Responsibility of AI: A Case Study in AI-assisted Bail Decision-making. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3411764.3445260
- Iterative Machine Teaching. In Proceedings of the 2017 International Conference on Machine Learning. https://doi.org/10.48550/arXiv.1705.10470
- Towards Black-box Iterative Machine Teaching. In Proceedings of the 2018 International Conference on Machine Learning. https://doi.org/10.48550/arXiv.1710.07742
- Iterative Teaching by Label Synthesis. Advances in Neural Information Processing Systems 34 (2021), 21681–21695. https://doi.org/10.48550/arXiv.2110.14432
- Boosting People’s Ability to Detect Microtargeted Advertising. Scientific Reports 11, 1 (2021), 15541. https://doi.org/10.1038/s41598-021-94796-z
- The Chicago Face Database: A Free Stimulus Set of Faces and Norming Data. Behavior Research Methods 47, 4 (2015), 1122–1135. https://doi.org/10.3758/s13428-014-0532-5
- Who Should I Trust: AI or Myself? Leveraging Human and AI Correctness Likelihood to Promote Appropriate Trust in AI-assisted Decision-making. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3544548.3581058
- Zilin Ma and Krzysztof Z. Gajos. 2022. Not Just a Preference: Reducing Biased Decision-making on Dating Websites. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3517587
- A Survey on Bias and Fairness in Machine Learning. Comput. Surveys 54, 6 (2021), 1–35. https://doi.org/10.1145/3457607
- Consequences of Stereotype Suppression: Stereotypes on AND not on the Rebound. Journal of Experimental Social Psychology 34, 4 (1998), 355–377. https://doi.org/10.1006/jesp.1998.1355
- Toward Involving End-users in Interactive Human-in-the-loop AI Fairness. ACM Transactions on Interactive Intelligent Systems 12, 3 (2022), 1–30. https://doi.org/10.1145/3514258
- Dong Nguyen. 2018. Comparing Automatic and Human Evaluation of Local Explanations for Text Classification. In Proceedings of the 2018 North American Chapter of the Association for Computational Linguistics. https://doi.org/10.18653/v1/N18-1097
- Individuals Fail to Reap the Collective Benefits of Diversity Because of Over-reliance on Personal Information. Journal of the Royal Society Interface 15, 142 (2018), 20180155. https://doi.org/10.1098/rsif.2018.0155
- Tomoko Oe and Takashi Oka. 2003. Overcoming the Ironic Rebound: Effective and Ineffective Strategies for Stereotype Suppression. Progress in Asian Social Psychology: Conceptual and Empirical Contributions. 331 (2003), 233–246. https://doi.org/10.1177/2372732220983840
- The Relative Influence of Advice from Human Experts and Statistical Methods on Forecast Adjustments. Journal of Behavioral Decision Making 22, 4 (2009), 390–409. https://doi.org/10.1002/bdm.637
- Understanding the Impact of Explanations on Advice-taking: A User Study for AI-based Clinical Decision Support Systems. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3502104
- Designing Fair AI in Human Resource Management: Understanding Tensions Surrounding Algorithmic Evaluation and Envisioning Stakeholder-centered Solutions. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3517672
- Putting Yourself in the Skin of a Black Avatar Reduces Implicit Racial Bias. Consciousness and Cognition 22, 3 (2013), 779–787. https://doi.org/10.1016/j.concog.2013.04.016
- What You See is What You Get? The Impact of Representation Criteria on Human Bias in Hiring. In Proceedings of the 2019 AAAI Conference on Human Computation and Crowdsourcing. https://doi.org/10.48550/arXiv.1909.03567
- Attitudes and Attitude Change. Annual Review of Psychology 48, 1 (1997), 609–647. https://doi.org/10.1146/annurev-psych-122216-011911
- Manipulating and Measuring Model Interpretability. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3411764.3445315
- Iterative Teaching by Data Hallucination. In Proceedings of the 2023 International Conference on Artificial Intelligence and Statistics. https://doi.org/10.48550/arXiv.2210.17467
- Amy Rechkemmer and Ming Yin. 2022. When Confidence Meets Accuracy: Exploring the Effects of Multiple Performance Indicators on Trust in Machine Learning Models. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3501967
- Guidelines for Adolescent Well Care: Is There Consensus? Current Opinion in Pediatrics 18, 4 (2006), 365–370. https://doi.org/10.1097/01.mop.0000236383.41531.8e
- Jon Roozenbeek and Sander Van der Linden. 2019. Fake News Game Confers Psychological Resistance Against Online Misinformation. Palgrave Communications 5, 1 (2019), 1–10. https://doi.org/10.2478/plc-2013-0009
- Cheap Talk and Credibility: The Consequences of Confidence and Accuracy on Advisor Credibility and Persuasiveness. Organizational Behavior and Human Decision Processes 121, 2 (2013), 246–255. https://doi.org/10.1016/j.obhdp.2013.02.001
- Measuring Non-expert Comprehension of Machine Learning Fairness Metrics. In Proceedings of the 2020 International Conference on Machine Learning. https://doi.org/10.48550/arXiv.2001.00089
- A Framework of High-stakes Algorithmic Decision-making for the Public Sector Developed through a Case Study of Child-welfare. Proceedings of the ACM on Human-Computer Interaction 5, CSCW2 (2021), 1–41. https://doi.org/10.1145/3476089
- Algorithmic Appreciation or Aversion? The Moderating Effects of Uncertainty on Algorithmic Decision Making. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3544549.3585908
- Effects of Distance Between Initial Estimates and Advice on Advice Utilization. Judgment and Decision Making 10, 2 (2015), 144–171. https://doi.org/10.1017/S1930297500003922
- Ignore, Trust, or Negotiate: Understanding Clinician Acceptance of AI-based Treatment Recommendations in Health Care. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3544548.3581075
- Shyam Sundar and Jinyoung Kim. 2019. Machine Heuristic: When We Trust Computers More than Humans with Our Personal Information. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3290605.3300768
- Capable but Amoral? Comparing AI and Human Expert Collaboration in Ethical Decision Making. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3491102.3517732
- Teaching an Active Learner with Contrastive Examples. Advances in Neural Information Processing Systems 34 (2021), 17968–17980. https://doi.org/10.48550/arXiv.2110.14888
- Factors Influencing Perceived Fairness in Algorithmic Decision-making: Algorithm Outcomes, Development Procedures, and Individual Differences. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3313831.3376813
- Xinru Wang and Ming Yin. 2021. Are Explanations Helpful? A Comparative Study of the Effects of Explanations in AI-assisted Decision-making. In Proceedings of the 2021 International Conference on Intelligent User Interfaces. https://doi.org/10.1145/3397481.3450650
- Timothy D. Wilson and Nancy Brekke. 1994. Mental Contamination and Mental Correction: Unwanted Influences on Judgments and Evaluations. Psychological Bulletin 116, 1 (1994), 117. https://doi.org/10.1037/0033-2909.116.1.117
- Iterative Classroom Teaching. In Proceedings of the 2019 AAAI Conference on Artificial Intelligence. https://doi.org/10.1609/aaai.v33i01.33015684
- Mitigating Unwanted Biases with Adversarial Learning. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society. https://doi.org/10.1145/3278721.3278779
- Unlearn What You Have Learned: Adaptive Crowd Teaching with Exponentially Decayed Memory Learners. In Proceedings of the 2018 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. https://doi.org/10.1145/3219819.3219952
- Mingzhe Yang (9 papers)
- Hiromi Arai (9 papers)
- Naomi Yamashita (10 papers)
- Yukino Baba (13 papers)