Learning the Covariance of Treatment Effects Across Many Weak Experiments (2402.17637v2)
Abstract: When primary objectives are insensitive or delayed, experimenters may instead focus on proxy metrics derived from secondary outcomes. For example, technology companies often infer the long-term impacts of product interventions from their effects on short-term user engagement signals. We consider the meta-analysis of many historical experiments to learn the covariance of treatment effects on these outcomes, which can support the construction of such proxies. Even when experiments are plentiful, if treatment effects are weak, the covariance of estimated treatment effects across experiments can be highly biased. We overcome this with techniques inspired by weak instrumental variable analysis. We show that Limited Information Maximum Likelihood (LIML) learns a parameter equivalent to fitting total least squares to a transformation of the scatterplot of treatment effects, and that Jackknife Instrumental Variables Estimation (JIVE) learns another parameter computable from the average of Jackknifed covariance matrices across experiments. We also present a total covariance estimator for the latter estimand under homoskedasticity, which is equivalent to a $k$-class estimator. We show how these parameters can be used to construct unbiased proxy metrics under various structural models. Lastly, we discuss the real-world application of our methods at Netflix.
- The limited information maximum likelihood estimator as an angle. CIRJE No. CIRJE-F-619, CIRJE, Faculty of Economics, University of Tokyo, 2009.
- Jackknife instrumental variables estimation. Journal of Applied Econometrics, 14(1):57–67, 1999.
- Estimating treatment effects using multiple surrogates: The role of the surrogate score and the surrogate index. arXiv preprint arXiv:1603.09326, 2016.
- Combining experimental and observational data to estimate treatment effects on long term outcomes. arXiv preprint arXiv:2006.09676, 2020.
- Interpreting findings from mendelian randomization using the mr-egger method. European journal of epidemiology, 32:377–389, 2017.
- Semiparametric estimation of long-term treatment effects. Journal of Econometrics, 237(2):105545, 2023.
- Interpreting experiments with multiple outcomes. 2020.
- Surrogacy marker paradox measures in meta-analytic settings. Biostatistics, 16(2):400–412, 2015.
- Estimation with weak instruments: Accuracy of higher-order bias and mse approximations. The Econometrics Journal, 7(1):272–306, 2004.
- Focusing on the long-term: It’s good for users and business. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1849–1858, 2015.
- Long-term causal inference under persistent confounding via data combination. arXiv preprint arXiv:2202.07234, 2022.
- On the role of surrogates in the efficient estimation of treatment effects with limited outcome data. arXiv preprint arXiv:2003.12408, 2020.
- Learning causal effects from many randomized experiments using regularized instrumental variables. In Proceedings of the 2018 World Wide Web Conference, pages 699–707, 2018.
- Ross L Prentice. Surrogate endpoints in clinical trials: definition and operational criteria. Statistics in medicine, 8(4):431–440, 1989.
- Choosing a proxy metric from past experiments. arXiv preprint arXiv:2309.07893, 2023.