Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Integrating Active Learning in Causal Inference with Interference: A Novel Approach in Online Experiments (2402.12710v1)

Published 20 Feb 2024 in stat.ME, cs.LG, and stat.ML

Abstract: In the domain of causal inference research, the prevalent potential outcomes framework, notably the Rubin Causal Model (RCM), often overlooks individual interference and assumes independent treatment effects. This assumption, however, is frequently misaligned with the intricate realities of real-world scenarios, where interference is not merely a possibility but a common occurrence. Our research endeavors to address this discrepancy by focusing on the estimation of direct and spillover treatment effects under two assumptions: (1) network-based interference, where treatments on neighbors within connected networks affect one's outcomes, and (2) non-random treatment assignments influenced by confounders. To improve the efficiency of estimating potentially complex effects functions, we introduce an novel active learning approach: Active Learning in Causal Inference with Interference (ACI). This approach uses Gaussian process to flexibly model the direct and spillover treatment effects as a function of a continuous measure of neighbors' treatment assignment. The ACI framework sequentially identifies the experimental settings that demand further data. It further optimizes the treatment assignments under the network interference structure using genetic algorithms to achieve efficient learning outcome. By applying our method to simulation data and a Tencent game dataset, we demonstrate its feasibility in achieving accurate effects estimations with reduced data requirements. This ACI approach marks a significant advancement in the realm of data efficiency for causal inference, offering a robust and efficient alternative to traditional methodologies, particularly in scenarios characterized by complex interference patterns.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. Alberto Abadie and Guido Imbens. 2002. Simple and bias-corrected matching estimators for average treatment effects.
  2. Alberto Abadie and Guido W Imbens. 2011. Bias-corrected matching estimators for average treatment effects. Journal of Business & Economic Statistics 29, 1 (2011), 1–11.
  3. Peter M Aronow and Cyrus Samii. 2017. Estimating average causal effects under general interference, with application to a social network experiment. (2017).
  4. Designing experiments to measure spillover effects. (2014).
  5. Reasoning about interference between units: A general framework. Political Analysis 21, 1 (2013), 97–124.
  6. Double/debiased machine learning for treatment and structural parameters.
  7. David Roxbee Cox. 1958. Planning of experiments. (1958).
  8. Estimating peer effects in networks with peer encouragement designs. Proceedings of the National Academy of Sciences 113, 27 (2016), 7316–7322.
  9. Identification and estimation of treatment and interference effects in observational studies on networks. J. Amer. Statist. Assoc. 116, 534 (2021), 901–918.
  10. Estimating causal effects under network interference with Bayesian generalized propensity scores. The Journal of Machine Learning Research 23, 1 (2022), 13101–13161.
  11. Measuring the effects of segregation in the presence of social spillovers: a nonparametric approach. Technical Report. National Bureau of Economic Research.
  12. M Elizabeth Halloran and Claudio J Struchiner. 1991. Study designs for dependent happenings. Epidemiology 2, 5 (1991), 331–338.
  13. M Elizabeth Halloran and Claudio J Struchiner. 1995. Causal inference in infectious diseases. Epidemiology (1995), 142–151.
  14. Miguel A Hernán and James M Robins. 2010. Causal inference.
  15. Guanglei Hong and Stephen W Raudenbush. 2006. Evaluating kindergarten retention policy: A case study of causal inference for multilevel observational data. J. Amer. Statist. Assoc. 101, 475 (2006), 901–910.
  16. Robust causal learning for the estimation of average treatment effects. In 2022 International Joint Conference on Neural Networks (IJCNN). IEEE, 1–9.
  17. Michael G Hudgens and M Elizabeth Halloran. 2008. Toward causal inference with interference. J. Amer. Statist. Assoc. 103, 482 (2008), 832–842.
  18. Adam Kołacz and Przemysław Grzegorzewski. 2016. Measures of dispersion for multidimensional data. European Journal of Operational Research 251, 3 (2016), 930–937.
  19. Michael P Leung and Pantelis Loupos. 2022. Unconfoundedness with network interference. arXiv preprint arXiv:2211.07823 (2022).
  20. Balancing covariates via propensity score weighting. J. Amer. Statist. Assoc. 113, 521 (2018), 390–400.
  21. On inverse probability-weighted estimators in the presence of interference. Biometrika 103, 4 (2016), 829–842.
  22. Xinwei Ma and Jingshen Wang. 2020. Robust inference using inverse probability weighting. J. Amer. Statist. Assoc. 115, 532 (2020), 1851–1860.
  23. Causal inference for social network data. J. Amer. Statist. Assoc. (2022), 1–15.
  24. Elizabeth L Ogburn and Tyler J VanderWeele. 2014. Causal diagrams for interference. (2014).
  25. Gaussian processes for machine learning. Vol. 1. Springer.
  26. Estimation of regression coefficients when some regressors are not always observed. Journal of the American statistical Association 89, 427 (1994), 846–866.
  27. Paul R Rosenbaum. 2007. Interference between units in randomized experiments. Journal of the american statistical association 102, 477 (2007), 191–200.
  28. Paul R Rosenbaum and Donald B Rubin. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 1 (1983), 41–55.
  29. Donald B Rubin. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology 66, 5 (1974), 688.
  30. Donald B Rubin. 1980. Randomization analysis of experimental data: The Fisher randomization test comment. Journal of the American statistical association 75, 371 (1980), 591–593.
  31. A tutorial on Gaussian process regression: Modelling, exploring, and exploiting functions. Journal of Mathematical Psychology 85 (2018), 1–16.
  32. Eli Sherman and Ilya Shpitser. 2018. Identification and estimation of causal effects from dependent data. Advances in neural information processing systems 31 (2018).
  33. Michael E Sobel. 2006. What do randomized studies of housing mobility demonstrate? Causal inference in the face of interference. J. Amer. Statist. Assoc. 101, 476 (2006), 1398–1407.
  34. Eric J Tchetgen Tchetgen and Tyler J VanderWeele. 2012. On causal inference in the presence of interference. Statistical methods in medical research 21, 1 (2012), 55–75.
  35. Mark J Van der Laan. 2014. Causal inference for a population of causally connected units. Journal of Causal Inference 2, 1 (2014), 13–74.
  36. Mediation and spillover effects in group-randomized trials: a case study of the 4Rs educational intervention. J. Amer. Statist. Assoc. 108, 502 (2013), 469–482.
  37. Francesco Vivarelli and Christopher Williams. 1998. Discovering hidden features with Gaussian processes regression. Advances in Neural Information Processing Systems 11 (1998).
  38. Shu Yang and Yunshu Zhang. 2023. Multiply robust matching estimators of average and quantile treatment effects. Scandinavian Journal of Statistics 50, 1 (2023), 235–265.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Hongtao Zhu (3 papers)
  2. Sizhe Zhang (4 papers)
  3. Yang Su (147 papers)
  4. Zhenyu Zhao (34 papers)
  5. Nan Chen (98 papers)
Citations (1)