Designing Optimal Behavioral Experiments Using Machine Learning (2305.07721v2)
Abstract: Computational models are powerful tools for understanding human cognition and behavior. They let us express our theories clearly and precisely, and offer predictions that can be subtle and often counter-intuitive. However, this same richness and ability to surprise means our scientific intuitions and traditional tools are ill-suited to designing experiments to test and compare these models. To avoid these pitfalls and realize the full potential of computational modeling, we require tools to design experiments that provide clear answers about what models explain human behavior and the auxiliary assumptions those models must make. Bayesian optimal experimental design (BOED) formalizes the search for optimal experimental designs by identifying experiments that are expected to yield informative data. In this work, we provide a tutorial on leveraging recent advances in BOED and machine learning to find optimal experiments for any kind of model that we can simulate data from, and show how by-products of this procedure allow for quick and straightforward evaluation of models and their parameters against real experimental data. As a case study, we consider theories of how people balance exploration and exploitation in multi-armed bandit decision-making tasks. We validate the presented approach using simulations and a real-world experiment. As compared to experimental designs commonly used in the literature, we show that our optimal designs more efficiently determine which of a set of models best account for individual human behavior, and more efficiently characterize behavior given a preferred model. At the same time, formalizing a scientific question such that it can be adequately addressed with BOED can be challenging and we discuss several potential caveats and pitfalls that practitioners should be aware of. We provide code and tutorial notebooks to replicate all analyses.
- Baker, Monya. 1,500 scientists lift the lid on reproducibility. Nature. 2016 May; 533(7604):452–454.
- Perceptual decision making: drift-diffusion model is equivalent to a Bayesian model. Frontiers in human neuroscience. 2014; 8:102.
- Science and data science. Proceedings of the National Academy of Sciences. 2017; 114(33):8689–8692.
- Intuitive experimentation in the physical world. Cognitive psychology. 2018; 105:9–38.
- Mining gold from implicit models to improve likelihood-free inference. Proceedings of the National Academy of Sciences. 2020; 117(10):5242–5249.
- Breiman, L. Random Forests. Machine Learning. 2001 10; 45:5–32.
- Neural Approximate Sufficient Statistics for Implicit Models. In: International Conference on Learning Representations (ICLR); 2021. .
- Clark, Herbert H. The language-as-fixed-effect fallacy: A critique of language statistics in psychological research. Journal of verbal learning and verbal behavior. 1973; 12(4):335–359.
- Frequency-dependent selection in vaccine-associated pneumococcal population dynamics. Nature Ecology & Evolution. 2017; 1:1950–1960.
- The frontier of simulation-based inference. Proceedings of the National Academy of Sciences. 2020; 117(48):30055–30062.
- Christine S.M. Currie and John W. Fowler and Kathy Kotiadis and Thomas Monks and Bhakti Stephan Onggo and Duncan A. Robertson and Antuela A. Tako. How simulation modelling can help reduce the impact of COVID-19. Journal of Simulation. 2020; 14(2):83–97.
- Dale, Anders M. Optimal experimental design for event-related fMRI. Human brain mapping. 1999; 8(2-3):109–114.
- Models that learn how humans learn: the case of decision-making and its disorders. PLoS computational biology. 2019; 15(6):e1006903.
- Sensitivity and specificity of information criteria. Briefings in bioinformatics. 2020; 21(2):553–565.
- Bayesian Optimization with Informative Covariance. Transactions on Machine Learning Research. 2023; .
- Thomas Elsken and Jan Hendrik Metzen and Frank Hutter. Neural Architecture Search: A Survey. Journal of Machine Learning Research. 2019; 20(55):1–21.
- Fisher, Ronald A. Frequency distribution of the values of the correlation coefficient in samples from an indefinitely large population. Biometrika. 1915; 10(4):507–521.
- Variational Bayesian Optimal Experimental Design. In: Advances in Neural Information Processing Systems; 2019. .
- Jerome H. Friedman. Greedy function approximation: A gradient boosting machine. The Annals of Statistics. 2001; 29(5):1189 – 1232.
- Hierarchical Reinforcement Learning Explains Task Interleaving Behavior. Computational Brain & Behavior. 2020; .
- Gershman, Samuel J. Deconstructing the human algorithms for exploration. Cognition. 2018; 173:34–42.
- Gershman, Samuel J. Origin of perseveration in the trade-off between reward and complexity. Cognition. 2020; 204:104394.
- Indirect inference for dynamic panel models. Journal of Econometrics. 2010; 157(1):68–77.
- Griffiths, Thomas L. Manifesto for a new (computational) cognitive revolution. Cognition. 2015; 135:21–23.
- How computational modeling can force theory building in psychological science. Perspectives on Psychological Science. 2021; 16(4):789–802.
- Mapping anhedonia onto reinforcement learning: a behavioural meta-analysis. Biology of mood & anxiety disorders. 2013; 3(1):1–16.
- Ioannidis, John PA. Why most published research findings are false. PLoS medicine. 2005; 2(8):e124.
- Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods. 35th Conference on Neural Information Processing Systems (NeurIPS 2021). 2021; .
- Unfalsifiability and mutual translatability of major modeling schemes for choice reaction time. Psychological review. 2014; 121(1):1.
- Highly accurate protein structure prediction with AlphaFold. Nature. 2021; p. 1–11.
- Parameter Inference for Computational Cognitive Models with Approximate Bayesian Computation. Cognitive Science. 2019; 43(6).
- Autistic traits, but not schizotypy, predict increased weighting of sensory information in Bayesian visual integration. ELife. 2018; 7.
- Sequential Bayesian Experimental Design for Implicit Models via Mutual Information. Bayesian Analysis. 2020; .
- Efficient Bayesian Experimental Design for Implicit Models. In: Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, vol. 89 of Proceedings of Machine Learning Research PMLR; 2019. p. 476–485.
- Bayesian Experimental Design for Implicit Models by Mutual Information Neural Estimation. In: Proceedings of the 37th International Conference on Machine Learning Proceedings of Machine Learning Research, PMLR; 2020. .
- Gradient-based Bayesian Experimental Design for Implicit Models using Mutual Information Lower Bounds. arXiv e-prints. 2021 May; p. arXiv:2105.04379.
- Deep Learning. Nature. 2015 05; 521:436–44.
- Psychological models of human and optimal performance in bandit problems. Cog Sys Research. 2011; 12(2).
- Strategy selection as rational metareasoning. Psychological review. 2017; 124(6):762.
- Maximizing the Information Content of Experiments in Systems Biology. PLOS Computational Biology. 2013 01; 9(1):1–13.
- D. V. Lindley. On a Measure of the Information Provided by an Experiment. The Annals of Mathematical Statistics. 1956; 27(4):986–1005.
- Fundamentals and Recent Developments in Approximate Bayesian Computation. SYSTEMATIC BIOLOGY. 2017; 66:17.
- The Automatic Neuroscientist: A framework for optimizing experimental design with closed-loop real-time fMRI. NeuroImage. 2016; 129:320–334.
- Markov chain Monte Carlo without likelihoods. Proceedings of the National Academy of Sciences. 2003; 100(26):15324–15328.
- Martinez-Cantin, Ruben. BayesOpt: a Bayesian optimization library for nonlinear optimization, experimental design and bandits. J Mach Learn Res. 2014; 15(1):3735–3739.
- Prior knowledge elicitation: The past, present, and future. arXiv preprint arXiv:211201380. 2021; .
- Peter Müller. Simulation-Based Optimal Design. Bayesian Statistics. 1999; 6:459–474.
- A tutorial on adaptive design optimization. Journal of Mathematical Psychology. 2013; 57(3-4):53–67.
- Optimal experimental design for model discrimination. Psych Review. 2009; 116(3).
- BOCK : Bayesian Optimization with Cylindrical Kernels. In: Proceedings of the 35th International Conference on Machine Learning, vol. 80 of Proceedings of Machine Learning Research PMLR; 2018. p. 3868–3877.
- webppl-oed: A practical optimal experiment design system. In: Proceedings of the annual meeting of the cognitive science society; 2018. .
- Antony M. Overstall and David C. Woods. Bayesian Design of Experiments Using Approximate Coordinate Exchange. Technometrics. 2017; 59(4):458–470.
- Likelihood-free methods for cognitive science. Springer; 2018.
- The importance of falsification in computational cognitive modeling. Trends in cognitive sciences. 2017; 21(6):425–433.
- Paninski, Liam. Asymptotic Theory of Information-Theoretic Experimental Design. Neural Computation. 2005 07; 17(7):1480–1507.
- Editors’ Introduction to the Special Section on Replicability in Psychological Science: A Crisis of Confidence? Perspectives on Psychological Science. 2012; 7(6):528–530. PMID: 26168108.
- Using large-scale experiments and machine learning to discover theories of human decision-making. Science. 2021; 372(6547):1209–1214.
- When a good fit can be bad. Trends in cognitive sciences. 2002; 6(10):421–425.
- On Variational Bounds of Mutual Information. In: Proceedings of the 36th International Conference on Machine Learning, vol. 97 of Proceedings of Machine Learning Research PMLR; 2019. p. 5171–5180.
- Modern bayesian experimental design. arXiv preprint arXiv:230214545. 2023; .
- A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions. ACM Comput Surv. 2021 may; 54(4).
- Robbins, Herbert. Some aspects of the sequential design of experiments. Bull Amer Math Soc. 1952 09; 58(5).
- Using approximate Bayesian computation to quantify cell–cell adhesion parameters in a cell migratory process. npj Systems Biology and Applications. 2017; 3(1):9.
- Learning representations by back-propagating errors. Nature. 1986 Oct; 323(6088):533–536.
- Inferring causation from time series in Earth system sciences. Nature communications. 2019; 10(1):1–13.
- A review of modern computational algorithms for Bayesian optimal design. Int Stat Review. 2016; 84(1).
- Toward a principled Bayesian workflow in cognitive science. Psychological methods. 2021; 26(1):103.
- Likelihood-Free Inference in Cosmology: Potential for the Estimation of Luminosity Functions. In: Statistical Challenges in Modern Astronomy V Springer New York; 2012. .
- Finding structure in multi-armed bandits. Cognitive psychology. 2020; 119:101261.
- God does not play dice: Causal determinism and preschoolers’ causal inferences. Child development. 2006; 77(2):427–442.
- Taking the Human Out of the Loop: A Review of Bayesian Optimization. In: Proceedings of the IEEE; 2015. .
- Shannon, Claude Elwood. A Mathematical Theory of Communication. The Bell System Technical Journal. 1948; 27:379–423.
- Sequential monte carlo without likelihoods. Proceedings of the National Academy of Sciences. 2007; 104(6):1760–1765.
- A Bayesian analysis of human decision-making on bandit problems. Journal of Mathematical Psychology. 2009; 53(3).
- Reinforcement learning: an introduction. 2nd ed. Adaptive computation and machine learning series, The MIT Press; 2018.
- Thompson, William R. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika. 1933; 25(3-4):285–294.
- Approaches to analysis in model-based cognitive neuroscience. Journal of Mathematical Psychology. 2017; 76:65–79.
- Competing theories of multialternative, multiattribute preferential choice. Psychological review. 2018; 125(3):329.
- A generalized, likelihood-free method for posterior estimation. Psychonomic bulletin & review. 2014; 21(2):227–250.
- A tutorial on approximate Bayesian computation. Journal of Mathematical Psychology. 2012; 56(2):69–85.
- Learning physical parameters from dynamic scenes. Cognitive psychology. 2018; 104:57–82.
- Bayesian Optimal Experimental Design for Simulator Models of Cognition. arXiv preprint arXiv:211015632. 2021; .
- Contextual sensitivity in scientific reproducibility. Proceedings of the National Academy of Sciences. 2016; 113(23):6454–6459.
- Attention is all you need. Advances in neural information processing systems. 2017; 30.
- Prefrontal cortex as a meta-reinforcement learning system. Nature neuroscience. 2018; 21(6):860–868.
- Application of computerized adaptive testing to educational problems. Journal of educational measurement. 1984; 21(4):361–375.
- Welch, Bernard L. The generalization of ‘STUDENT’S’problem when several different population varlances are involved. Biometrika. 1947; 34(1-2):28–35.
- Ten simple rules for the computational modeling of behavioral data. Elife. 2019; 8:e49547.
- Wolpert, David H. The lack of a priori distinctions between learning algorithms. Neural computation. 1996; 8(7):1341–1390.
- Yarkoni, Tal. The generalizability crisis. Behavioral and Brain Sciences. 2022; 45:e1.
- Optimal experimental design for a class of bandit problems. Journal of Mathematical Psychology. 2010; 54(6).
- Human and optimal exploration and exploitation in bandit problems. Ratio. 2009; 13(14):15.
- Jie Zhou and Ganqu Cui and Shengding Hu and Zhengyan Zhang and Cheng Yang and Zhiyuan Liu and Lifeng Wang and Changcheng Li and Maosong Sun. Graph neural networks: A review of methods and applications. AI Open. 2020; 1:57–81.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.