OpportunityFinder: A Framework for Automated Causal Inference (2309.13103v1)
Abstract: We introduce OpportunityFinder, a code-less framework for performing a variety of causal inference studies with panel data for non-expert users. In its current state, OpportunityFinder only requires users to provide raw observational data and a configuration file. A pipeline is then triggered that inspects/processes data, chooses the suitable algorithm(s) to execute the causal study. It returns the causal impact of the treatment on the configured outcome, together with sensitivity and robustness results. Causal inference is widely studied and used to estimate the downstream impact of individual's interactions with products and features. It is common that these causal studies are performed by scientists and/or economists periodically. Business stakeholders are often bottle-necked on scientist or economist bandwidth to conduct causal studies. We offer OpportunityFinder as a solution for commonly performed causal studies with four key features: (1) easy to use for both Business Analysts and Scientists, (2) abstraction of multiple algorithms under a single I/O interface, (3) support for causal impact analysis under binary treatment with panel data and (4) dynamic selection of algorithm based on scale of data.
- [n. d.]. Causal Inference on IHDP: Benchmark. https://paperswithcode.com/sota/causal-inference-on-ihdp.
- [n. d.]. No Free Lunch in Causal Inference. https://p-hunermund.com/2018/06/09/no-free-lunch-in-causal-inference/.
- Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program. J. Amer. Statist. Assoc. 105, 490 (2010), 493–505. https://doi.org/10.1198/jasa.2009.ap08746 arXiv:https://doi.org/10.1198/jasa.2009.ap08746
- Alberto Abadie and Javier Gardeazabal. 2003. The economic costs of conflict: A case study of the Basque Country. American economic review (2003), 113–132.
- Joshua D. Angrist and Jorn-Steffen Pischke. 2009. Mostly harmless econometrics: An empiricist’s companion. Princeton university press.
- Synthetic Difference in Differences. arXiv:1812.09970Â [stat.ME]
- DoubleML – An Object-Oriented Implementation of Double Machine Learning in Python. Journal of Machine Learning Research 23, 53 (2022), 1–6. http://jmlr.org/papers/v23/21-0862.html
- EconML: A Python Package for ML-Based Heterogeneous Treatment Effects Estimation. https://github.com/py-why/EconML. Version 0.x.
- David Card and Alan B Krueger. 1994. Minimum wages and employment: A case study of the fast-food industry in New Jersey and Pennsylvania. The American economic review 84, 4 (1994), 772–793.
- CausalML: Python Package for Causal Machine Learning. arXiv:2002.11631Â [cs.CY]
- Does Strengthening Self-Defense Law Deter Crime or Escalate Violence? Evidence from Castle Doctrine. Working Paper 18134. National Bureau of Economic Research. https://doi.org/10.3386/w18134
- Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal 21, 1 (2018), C1–C68.
- Rajeev H. Dehejia and Sadek Wahba. 2002. Propensity Score-Matching Methods for Nonexperimental Causal Studies. The Review of Economics and Statistics 84, 1 (02 2002), 151–161. https://doi.org/10.1162/003465302317331982 arXiv:https://direct.mit.edu/rest/article-pdf/84/1/151/1613304/003465302317331982.pdf
- AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data. arXiv preprint arXiv:2003.06505 (2020).
- Auto-Sklearn 2.0: Hands-free AutoML via Meta-Learning. arXiv:2007.04074 [cs.LG] (2020).
- Auto-Causality: A Python package for Automated Causal Inference model estimation and selection. https://github.com/transferwise/auto-causality. Version 0.x.
- Meta-learners for Estimating Heterogeneous Treatment Effects using Machine Learning. Proceedings of the National Academy of Sciences 116, 10 (2019), 4156–4165.
- Robert Lalonde. 1986. Evaluating the Econometric Evaluations of Training Programs with Experiment Data. American Economic Review 76 (02 1986), 604–20.
- Erin LeDell and Sebastien Poirier. 2020. H2O AutoML: Scalable Automatic Machine Learning. 7th ICML Workshop on Automated Machine Learning (AutoML) (July 2020). https://www.automl.org/wp-content/uploads/2020/07/AutoML_2020_paper_61.pdf
- Paul R Rosenbaum and Donald B Rubin. 1983. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 1 (1983), 41–55.
- Estimating individual treatment effect: generalization bounds and algorithms. In Proceedings of the 34th International Conference on Machine Learning-Volume 70. JMLR. org, 3076–3085.
- DoWhy: A Python package for causal inference. https://github.com/microsoft/dowhy.
- Adapting Neural Networks for the Estimation of Treatment Effects. arXiv:1906.02120Â [stat.ML]
- Learning end-to-end patient representations through self-supervised covariate balancing for causal treatment effect estimation. Journal of Biomedical Informatics 140 (2023), 104339.
- Stefan Wager and Susan Athey. 2018. Estimation and Inference of Heterogeneous Treatment Effects using Random Forests. J. Amer. Statist. Assoc. 113, 523 (2018), 1228–1242.
- Yiqing Xu. 2017. Generalized Synthetic Control Method: Causal Inference with Interactive Fixed Effects Models. Political Analysis 25, 1 (2017), 57–76. https://doi.org/10.1017/pan.2016.2
- GANITE: Estimation of Individualized Treatment Effects using Generative Adversarial Nets. In International Conference on Learning Representations. https://openreview.net/forum?id=ByKWUeWA-
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.