Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Active Adaptive Experimental Design for Treatment Effect Estimation with Covariate Choices (2403.03589v2)

Published 6 Mar 2024 in stat.ME, cs.LG, econ.EM, and stat.ML

Abstract: This study designs an adaptive experiment for efficiently estimating average treatment effects (ATEs). In each round of our adaptive experiment, an experimenter sequentially samples an experimental unit, assigns a treatment, and observes the corresponding outcome immediately. At the end of the experiment, the experimenter estimates an ATE using the gathered samples. The objective is to estimate the ATE with a smaller asymptotic variance. Existing studies have designed experiments that adaptively optimize the propensity score (treatment-assignment probability). As a generalization of such an approach, we propose optimizing the covariate density as well as the propensity score. First, we derive the efficient covariate density and propensity score that minimize the semiparametric efficiency bound and find that optimizing both covariate density and propensity score minimizes the semiparametric efficiency bound more effectively than optimizing only the propensity score. Next, we design an adaptive experiment using the efficient covariate density and propensity score sequentially estimated during the experiment. Lastly, we propose an ATE estimator whose asymptotic variance aligns with the minimized semiparametric efficiency bound.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (56)
  1. Adusumilli, K. Minimax policies for best arm identification with two arms, 2022. arXiv:2204.05527.
  2. Identification of causal effects using instrumental variables. Journal of the American Statistical Association, 91(434):444–455, 1996.
  3. Best arm identification in multi-armed bandits. In Conference on Learning Theory (COLT), pp.  41–53, 2010.
  4. Sequential nonparametric testing with the law of the iterated logarithm. In Conference on Uncertainty in Artificial Intelligence (UAI), pp.  42–51, 2016.
  5. Sequential Identification and Ranking Procedures: With Special Reference to Koopman-Darmois Populations. University of Chicago Press, 1968.
  6. Pure exploration in multi-armed bandits problems. In Algorithmic Learning Theory (ALT), pp.  23–37. Springer Berlin Heidelberg, 2009.
  7. Pure exploration in finitely-armed and continuous-armed bandits. Theoretical Computer Science, 2011.
  8. Double/debiased machine learning for treatment and structural parameters. Econometrics Journal, 21:C1–C68, 2018.
  9. Adaptive Design Methods in Clinical Trials. Chapman and Hall/CRC, 2 edition, 2011.
  10. Statistical consideration of adaptive methods in clinical development. Journal of Biopharmaceutical Statistics, 2005.
  11. Semiparametric efficient inference in adaptive experiments, 2023. arXiv:2311.18274.
  12. FDA. Adaptive designs for clinical trials of drugs and biologics: Guidance for industry. Technical report, U.S. Department of Health and Human Services Food and Drug Administration (FDA), Center for Drug Evaluation and Research (CDER), Center for Biologics Evaluation and Research (CBER), 2019.
  13. Optimal best arm identification with fixed confidence. In Conference on Learning Theory (COLT), 2016.
  14. Calculus of variations. Courier Corporation, 2000.
  15. Bayesian Data Analysis. Chapman and Hall/CRC, 2004.
  16. Confidence intervals for policy evaluation in adaptive experiments, 2019.
  17. Hahn, J. On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica, 66(2):315–331, 1998.
  18. Adaptive experimental design using the propensity score. Journal of Business and Economic Statistics, 2011.
  19. Hamilton, J. Time series analysis. Princeton Univ. Press, 1994.
  20. Randomised controlled trials - the gold standard for effectiveness research: Study design: randomised controlled trials. International Journal of Obstetrics & Gynaecology, 125(13), 2018.
  21. A paradox concerning nuisance parameters and projected estimating functions. Biometrika, 2004.
  22. Efficient estimation of average treatment effects using the estimated propensity score. Econometrica, 2003.
  23. A puzzling phenomenon in semiparametric estimation problems with infinite-dimensional nuisance parameters. Econometric Theory, 24(6):1717–1728, 2008.
  24. Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. Cambridge University Press, 2015.
  25. Adaptive experimental design for efficient treatment effect estimation: Randomized allocation via contextual bandit algorithm. arXiv:2002.05308, 2020.
  26. The adaptive doubly robust estimator and a paradox concerning logging policy. In Advances in Neural Information Processing Systems (NeurIPS), 2021.
  27. Asymptotically optimal fixed-budget best arm identification with variance-dependent bounds, 2023. arXiv:2302.02988.
  28. On the complexity of best-arm identification in multi-armed bandit models. Journal of Machine Learning Research, 17(1):1–42, 2016.
  29. Klaassen, C. A. J. Consistent estimation of the influence function of locally asymptotically linear estimators. Annal of Statistics, 1987.
  30. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 1985.
  31. Loeve, M. Probability Theory. Graduate Texts in Mathematics. Springer, 1977.
  32. Statistical inference for the mean outcome under a possibly non-unique optimal treatment strategy. Annals of statistics, 44(2):713–742, Apr 2016.
  33. Nadaraya, E. A. On estimating regression. Theory of Probability & Its Applications, 9(1):141–142, 1964.
  34. Efficient counterfactual learning from bandit feedback. In AAAI Conference on Artificial Intelligence (AAAI), 2019.
  35. Neyman, J. Sur les applications de la theorie des probabilites aux experiences agricoles: Essai des principes. Statistical Science, 5:463–472, 1923.
  36. Neyman, J. On the two different aspects of the representative method: the method of stratified sampling and the method of purposive selection. Journal of the Royal Statistical Society, 97:123–150, 1934.
  37. Owen, A. B. Monte Carlo theory, methods and examples. https://artowen.su.domains/mc/, 2013.
  38. Kernel estimation and model combination in a bandit problem with covariates. Journal of Machine Learning Research, 2016.
  39. Rockafellar, R. T. Convex analysis. Princeton university press, 2015.
  40. Rubin, D. B. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5):688, 1974.
  41. A/b/n testing with control in the presence of subpopulations. In Advances in Neural Information Processing Systems (NeurIPS), 2021.
  42. Shimodaira, H. Improving predictive inference under covariate shift by weighting the log-likelihood function. Journal of Statistical Planning and Inference, 90(2):227–244, 2000. ISSN 0378-3758.
  43. Sugiyama, M. Active learning in approximately linear regression based on conditional expectation of generalization error. Journal of Machine Learning Research, 7(6):141–166, 2006.
  44. Pool-based active learning in approximate linear regression. Machine Learning, 75(3):249–274, Jun 2009.
  45. Tabord-Meehan, M. Stratification Trees for Adaptive Randomisation in Randomised Controlled Trials. The Review of Economic Studies, 90(5):2646–2673, 12 2022. ISSN 0034-6527.
  46. Thompson, W. R. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 1933.
  47. Off-policy evaluation and learning for external validity under a covariate shift. In Advances in Neural Information Processing Systems (NeurIPS), volume 33, pp.  49–61, 2020.
  48. van der Laan, M. J. The construction and analysis of adaptive group sequential designs, 2008. URL https://biostats.bepress.com/ucbbiostat/paper232. U.C. Berkeley Division of Biostatistics Working Paper Series.
  49. Online targeted learning, 2014. URL https://biostats.bepress.com/ucbbiostat/paper330.
  50. van der Vaart, A. Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 1998.
  51. Vershynin, R. High-dimensional probability, volume 7. Cambridge University Press, Cambridge, 2018.
  52. Multi-armed bandit models for the optimal design of clinical trials: Benefits and challenges. Statistical Science, 30:199–215, 2015.
  53. Watson, G. S. Smooth regression analysis. Sankhyā: The Indian Journal of Statistics, Series A (1961-2002), 26(4):359–372, 1964.
  54. White, H. Chapter v - central limit theory. In Asymptotic Theory for Econometricians, Economic Theory, Econometrics, and Mathematical Economics, pp.  107–131. 1984.
  55. Wu, D. Pool-based sequential active learning for regression. IEEE Transactions on Neural Networks and Learning Systems, 30(5):1348–1359, 2019.
  56. Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates. Annals of Statistics, 30(1):100–121, 2002.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com