Designing Algorithmic Recommendations to Achieve Human-AI Complementarity (2405.01484v2)
Abstract: Algorithms frequently assist, rather than replace, human decision-makers. However, the design and analysis of algorithms often focus on predicting outcomes and do not explicitly model their effect on human decisions. This discrepancy between the design and role of algorithmic assistants becomes particularly concerning in light of empirical evidence that suggests that algorithmic assistants again and again fail to improve human decisions. In this article, we formalize the design of recommendation algorithms that assist human decision-makers without making restrictive ex-ante assumptions about how recommendations affect decisions. We formulate an algorithmic-design problem that leverages the potential-outcomes framework from causal inference to model the effect of recommendations on a human decision-maker's binary treatment choice. Within this model, we introduce a monotonicity assumption that leads to an intuitive classification of human responses to the algorithm. Under this assumption, we can express the human's response to algorithmic recommendations in terms of their compliance with the algorithm and the active decision they would take if the algorithm sends no recommendation. We showcase the utility of our framework using an online experiment that simulates a hiring task. We argue that our approach can make sense of the relative performance of different recommendation algorithms in the experiment and can help design solutions that realize human-AI complementarity. Finally, we leverage our approach to derive minimax optimal recommendation algorithms that can be implemented with machine learning using limited training data.
- Combining Human Expertise with Artificial Intelligence: Experimental Evidence from Radiology.
- Andrews, Isaiah and Jesse M. Shapiro (2021). A Model of Scientific Communication. Econometrica, 89(5):2117–2142.
- Algorithmic Recommendations and Human Discretion.
- Identification of Causal Effects Using Instrumental Variables. Journal of the American Statistical Association, 91(434):444–455.
- The Allocation of Decision Authority to Human and Artificial Intelligence. AEA Papers and Proceedings, 110:80–84.
- Policy Optimization for Personalized Interventions in Behavioral Health. arXiv:2303.12206 [cs].
- Improving Human-Algorithm Collaboration: Causes and Mitigation of Over- and Under-Adherence.
- Beyond Accuracy: The Role of Mental Models in Human-AI Team Performance. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 7:2–11.
- Improving Human Decision-Making with Machine Learning. arXiv:2108.08454 [cs].
- Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment. arXiv:2109.11679 [cs, stat].
- Bergemann, Dirk and Stephen Morris (2019). Information Design: A Unified Perspective. Journal of Economic Literature, 57(1):44–95.
- Bertsimas, Dimitris and Nathan Kallus (2020). From Predictive to Prescriptive Analytics. Management Science, 66(3):1025–1044.
- Caro, Felipe and Anna Sáez de Tejada Cuenca (2023). Believing in Analytics: Managers’ Adherence to Price Recommendations from a DSS. Manufacturing & Service Operations Management, page msom.2022.1166.
- Human-Algorithm Collaboration: Achieving Complementarity and Avoiding Unfairness. arXiv:2202.08821 [cs].
- A Case for Humans-in-the-Loop: Decisions in the Presence of Misestimated Algorithmic Scores.
- Green, Ben and Yiling Chen (2019). The Principles and Limits of Algorithm-in-the-Loop Decision Making. Proceedings of the ACM on Human-Computer Interaction, 3(CSCW):50:1–50:24.
- Human-AI Complementarity in Hybrid Intelligence Systems: A Structured Literature Review. PACIS 2021 Proceedings.
- Eliciting Human Judgment for Prediction Algorithms. Management Science, 67(4):2314–2325.
- Experimental Evaluation of Algorithm-Assisted Human Decision-Making: Application to Pretrial Public Safety Assessment. arXiv:2012.02845 [cs, stat].
- Imbens, Guido W. and Joshua D. Angrist (1994). Identification and Estimation of Local Average Treatment Effects. Econometrica, 62(2):467.
- Kamenica, Emir and Matthew Gentzkow (2011). Bayesian Persuasion. American Economic Review, 101(6):2590–2615.
- Towards a Science of Human-AI Decision Making: A Survey of Empirical Studies. arXiv:2112.11471 [cs].
- Lee, Sokbae and Bernard Salanié (2018). Identifying effects of multivalued treatments. Econometrica, 86(6):1939–1963.
- Algorithmic Hiring in Practice: Recruiter and HR Professional’s Perspectives on AI Use in Hiring. In Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’21, pages 166–176, New York, NY, USA. Association for Computing Machinery.
- Manski, Charles F. (2021). Econometrics for Decision Making: Building Foundations Sketched by Haavelmo and Wald. Econometrica, 89(6):2827–2853.
- Manski, Charles F. (2023). Probabilistic prediction for binary treatment choice: With focus on personalized medicine. Journal of Econometrics, 234(2):647–663.
- Artificial Intelligence and Its Effect on Dermatologists’ Accuracy in Dermoscopic Melanoma Image Classification: Web-Based Survey Study. Journal of Medical Internet Research, 22(9):e18091.
- McLaughlin, Bryce and Jann Spiess (2022). Algorithmic Assistance with Recommendation-Dependent Preferences. arXiv:2208.07626 [cs, econ, q-fin].
- Mills, Chris and Marie-Pascale Grimon (2022). The Impact of Algorithmic Tools on Child Protection: Evidence from a Randomized Controlled Trial.
- Mogstad, Magne and Alexander Torgovitsky (2018). Identification and Extrapolation of Causal Effects with Instrumental Variables. Annual Review of Economics, 10(1):577–613.
- Noti, Gali and Yiling Chen (2023). Learning When to Advise Human Decision Makers. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, pages 3038–3048, Macau, SAR China. International Joint Conferences on Artificial Intelligence Organization.
- Algorithm, Human, or the Centaur: How to Enhance Clinical Care?
- Mitigating bias in algorithmic hiring: evaluating claims and practices. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, FAT* ’20, pages 469–481, New York, NY, USA. Association for Computing Machinery.
- The Algorithmic Automation Problem: Prediction, Triage, and Human Effort. arXiv:1903.12220 [cs].
- Rubin, Donald (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688–701.
- Saghafian, Soroush (2023). Ambiguous Dynamic Treatment Regimes: A Reinforcement Learning Approach. Management Science, page mnsc.2022.00883.
- Spiess, Jann (2022). Optimal Estimation when Researcher and Social Preferences are Misaligned.
- Stevenson, Megan T. and Jennifer L. Doleac (2022). Algorithmic Risk Assessment in the Hands of Humans.
- Predicting Human Discretion to Adjust Algorithmic Prescription: A Large-Scale Field Experiment in Warehouse Operations. Management Science, 68(2):846–865.
- An overview of clinical decision support systems: benefits, risks, and strategies for success. npj Digital Medicine, 3(1):1–10.
- Vytlacil, Edward (2002). Independence, Monotonicity, and Latent Index Models: An Equivalence Result. Econometrica, 70(1):331–341.
- The Rational Agent Benchmark for Data Visualization. arXiv:2304.03432 [cs].
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.