The Value of Context: Human versus Black Box Evaluators (2402.11157v2)
Abstract: Machine learning algorithms are now capable of performing evaluations previously conducted by human experts (e.g., medical diagnoses). How should we conceptualize the difference between evaluation by humans and by algorithms, and when should an individual prefer one over the other? We propose a framework to examine one key distinction between the two forms of evaluation: Machine learning algorithms are standardized, fixing a common set of covariates by which to assess all individuals, while human evaluators customize which covariates are acquired to each individual. Our framework defines and analyzes the advantage of this customization -- the value of context -- in environments with high-dimensional data. We show that unless the agent has precise knowledge about the joint distribution of covariates, the benefit of additional covariates generally outweighs the value of context.
- Acemoglu, D., V. Chernozhukov, and M. Yildiz (2015): “Fragility of Asymptotic Agreement under Bayesian Learning,” Theoretical Economics, 11, 187–225.
- Acosta, J., G. Falcone, P. Rajpurkar, and E. Topol (2022): “Multimodal biomedical AI,” Nature Medicine, 28, 1773–1784.
- Agarwal, N., A. Moehring, P. Rajpurkar, and T. Salz (2023): “Combining Human Expertise with Artificial Intelligence: Experimental Evidence from Radiology,” Working Paper 31422, National Bureau of Economic Research.
- Angelova, V., W. Dobbie, , and C. S. Yang (2022): “Algorithmic Recommendations and Human Discretion,” Working Paper.
- Antic, N. and A. Chakraborty (2023): “Selected Facts,” Working Paper.
- Arnold, B. C. and R. A. Groeneveld (1979): “Bounds on expectations of linear systematic statistics based on dependent samples,” The Annals of Statistics, 220–223.
- Bardhi, A. (2023): “Attributes: Selective Learning and Influence,” Working Paper.
- Bastani, H., O. Bastani, and W. P. Sinchaisri (2022): “Improving Human Decision-Making with Machine Learning,” .
- Berman, S. M. (1964): “Limit Theorems for the Maximum Term in Stationary Sequences,” The Annals of Mathematical Statistics, 35, 502 – 516.
- Blackwell, D. and L. Dubins (1962): “Merging of Opinions with Increasing Information,” The Annals of Mathematical Statistics.
- Chalfin, A., O. Danieli, A. Hillis, Z. Jelveh, M. Luca, J. Ludwig, and S. Mullainathan (2016): “Productivity and Selection of Human Capital with Machine Learning,” American Economic Review, 106, 124–27.
- Chernozhukov, V., D. Chetverikov, and K. Kato (2013): “Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors,” .
- Crawford, V. P. and J. Sobel (1982): “Strategic information transmission,” Econometrica: Journal of the Econometric Society, 1431–1451.
- Cripps, M., J. Ely, G. Mailath, and L. Samuelson (2008): “Common Learning,” Econometrica, 76, 909–933.
- Di Tillio, A., M. Ottaviani, and P. N. Sørensen (2021): “Strategic Sample Selection,” Econometrica, 89, 911–953.
- Dworczak, P. and G. Martini (2019): “The Simple Economics of Optimal Persuasion,” Journal of Political Economy, 127, 1993–2048.
- Dye, R. A. (1985): “Disclosure of Nonproprietary Information,” Journal of Accounting Research, 23, 123–145.
- Farina, A., G. Frechette, A. Lizzeri, and J. Perego (2023): “The Selective Disclosure of Evidence: An Experiment,” Working Paper.
- Frankel, A. (2014): “Aligned Delegation,” American Economic Review, 104, 66–83.
- Gillis, T., B. McLaughlin, and J. Spiess (2021): “On the Fairness of Machine-Assisted Human Decisions,” Working Paper.
- Glazer, J. and A. Rubinstein (2004): “On optimal rules of persuasion,” Econometrica, 72, 1715–1736.
- Haghtalab, N., M. Jackson, and A. Procaccia (2021): “Belief polarization in a complex world: A learning theory perspective,” PNAS, 118, 141–73.
- Hoffman, M., L. B. Kahn, and D. Li (2017): “Discretion in Hiring*,” The Quarterly Journal of Economics, 133, 765–800.
- Jha, S. (2020): “Can you sue an algorithm for malpractice? It depends,” .
- Jin, G. Z., M. Luca, and D. Martin (2021): “Is No News (Perceived As) Bad News? An Experimental Investigation of Information Disclosure,” American Economic Journal: Microeconomics, 13, 141–73.
- Jung, J., C. Concannon, R. Shroff, S. Goel, and D. G. Goldstein (2017): “Simple rules for complex decisions,” Working Paper.
- Jussupow, Ekaterina; Benbasat, I. and A. Heinzl (2020): “Why are we averse towards algorithms? A comprehensive literature review on algorithm aversion,” in In Proceedings of the 28th European Conference on Information Systems.
- Kamenica, E. and M. Gentzkow (2011): “Bayesian Persuasion,” American Economic Review, 101, 2590–2615.
- Klabjan, D., W. Olszewski, and A. Wolinsky (2014): “Attributes,” Games and Economic Behavior, 88, 190–206.
- Kleinberg, J., H. Lakkaraju, J. Leskovec, J. Ludwig, and S. Mullainathan (2017): “Human Decisions and Machine Predictions,” The Quarterly Journal of Economics, 133, 237–293.
- Lai, V., C. Chen, A. Smith-Renner, Q. V. Liao, and C. Tan (2023): “Towards a Science of Human-AI Decision Making: An Overview of Design Space in Empirical Human-Subject Studies,” in Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency, New York, NY, USA: Association for Computing Machinery, FAccT ’23, 1369–1385.
- Liang, A., X. Mu, and V. Syrgkanis (2022): “Dynamically Aggregating Diverse Information,” Econometrica, 90, 47–80.
- Longoni, C., A. Bonezzi, and C. K. Morewedge (2019): “Resistance to Medical Artificial Intelligence,” Journal of Consumer Research, 46, 629–650.
- McLaughlin, B. and J. Spiess (2022): “Algorithmic Assistance with Recommendation-Dependent Preferences,” Working Paper.
- Milgrom, P. R. (1981): “Good News and Bad News: Representation Theorems and Applications,” The Bell Journal of Economics, 12, 380–391.
- Morris, S. and M. Yildiz (2019): “Crises: Equilibrium Shifts and Large Shocks,” American Economic Review, 109, 2823–54.
- Obermeyer, Z. and E. J. Emanuel (2016): “Predicting the Future - Big Data, Machine Learning, and Clinical Medicine,” The New England Journal of Medicine, 375, 1216–9.
- Rajpurkar, P., J. Irvin, K. Zhu, B. Yang, H. Mehta, T. Duan, D. Ding, A. Bagul, C. Langlotz, K. Shpanskaya, M. P. Lungren, and A. Y. Ng (2017): “CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning,” Working Paper.
- Spiegler, R. (2020): “Behavioral Implications of Causal Misperceptions,” Annual Review of Economics, 12, 81–106.