2000 character limit reached
Towards Automated Causal Discovery: a case study on 5G telecommunication data (2402.14481v1)
Published 22 Feb 2024 in cs.LG and stat.ME
Abstract: We introduce the concept of Automated Causal Discovery (AutoCD), defined as any system that aims to fully automate the application of causal discovery and causal reasoning methods. AutoCD's goal is to deliver all causal information that an expert human analyst would and answer a user's causal queries. We describe the architecture of such a platform, and illustrate its performance on synthetic data sets. As a case study, we apply it on temporal telecommunication data. The system is general and can be applied to a plethora of causal discovery problems.
- Scoring bayesian networks of mixed variables. International Journal of Data Science and Analytics, 6:3–18, 2017.
- Learning high-dimensional directed acyclic graphs with mixed data-types. Proceedings of machine learning research, 104:4–21, 2019.
- Out-of-sample tuning for causal discovery. IEEE transactions on neural networks and learning systems, PP, 2022.
- Dowhy-gcm: An extension of dowhy for causal inference in graphical causal models, 2022.
- Giorgos Borboudakis and I. Tsamardinos. Forward-backward selection with early dropping. J. Mach. Learn. Res., 20:8:1–8:39, 2019a.
- Forward-backward selection with early dropping. Journal of Machine Learning Research, 20(8):1–39, 2019b.
- Tools and algorithms for causally interpreting directed edges in maximal ancestral graphs. In Proceedings of the Sixth European Workshop on Probabilistic Graphical Models (PGM 2012), 2012.
- David Maxwell Chickering. Learning equivalence classes of bayesian-network structures. J. Mach. Learn. Res., 2002a.
- David Maxwell Chickering. Optimal structure identification with greedy search. J. Mach. Learn. Res., 3:507–554, 2002b.
- Order-independent constraint-based causal structure learning. J. Mach. Learn. Res., 15:3741–3782, 2014.
- Learning high-dimensional directed acyclic graphs with latent and selection variables. The Annals of Statistics, 40(1):294–321, 2012.
- A bayesian method for the induction of probabilistic networks from data. Mach. Learn., 9(4):309–347, October 1992.
- Bootstrap aggregation and confidence measures to improve time series causal discovery, 2023.
- CausalTune: A Python package for Automated Causal Inference model estimation and selection. https://github.com/py-why/causaltune, 2022.
- Efficient prefdiv algorithms for effective top-k result diversification. In International Conference on Extending Database Technology, 2020.
- Causalmgm: an interactive web-based causal discovery tool. Nucleic Acids Research, 48:W597 – W602, 2020.
- High-recall causal discovery for autocorrelated time series with latent confounders. ArXiv, abs/2007.01884, 2020.
- Learning bayesian networks: The combination of knowledge and statistical data. Machine Learning, 20:197–243, 1995.
- Estimation of a structural vector autoregression model using non-gaussianity. Journal of Machine Learning Research, 11(56):1709–1731, 2010.
- Structural agnostic modeling: Adversarial learning of causal graphs. J. Mach. Learn. Res., 23:219:1–219:62, 2018.
- Feature selection with the r package mxm: Discovering statistically-equivalent feature subsets. ArXiv, 2016.
- Feature selection with the R package MXM: Discovering statistically equivalent feature subsets. Journal of Statistical Software, 80(7), 2017.
- Estimating high-dimensional intervention effects from observational data. Annals of Statistics, 37:3133–3164, 2009.
- Causal structure learning from multivariate time series in settings with unmeasured confounding. In CD@KDD, 2018.
- Richard Neapolitan. Learning Bayesian Networks. Pearson Prentice Hall, 01 2003.
- Opportunityfinder: A framework for automated causal inference. In KDD 2023 Workshop on Causal Inference and Machine Learning in Practice: Use cases for Product, Brand, Policy and Beyond, 2023.
- A hybrid causal search algorithm for latent variable models. In Proceedings of the Eighth International Conference on Probabilistic Graphical Models, volume 52, pages 368–379. PMLR, 09 2016.
- Dynotears: Structure learning from time-series data. In Silvia Chiappa and Roberto Calandra, editors, Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, volume 108 of Proceedings of Machine Learning Research, pages 1595–1605. PMLR, 2020.
- Judea Pearl. Causality: Models, Reasoning and Inference. Cambridge University Press, USA, 2nd edition, 2009.
- J. Pellet and A. Elisseeff. Finding latent causes in causal networks: an efficient approach based on markov blankets. In NIPS, 2008.
- Causal inference on time series using restricted structural equation models. In C.J. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K.Q. Weinberger, editors, Advances in Neural Information Processing Systems, volume 26. Curran Associates, Inc., 2013.
- Evaluation of causal structure learning methods on mixed data types. Proceedings of machine learning research, 92:48–65, 2018a.
- Comparison of strategies for scalable causal discovery of latent variable models from mixed data. International Journal of Data Science and Analytics, 6:33 – 45, 2018b.
- Adjacency-faithfulness and conservative causal inference. Proceedings of the 22nd Conference on Uncertainty in Artificial Intelligence, UAI 2006, 06 2012.
- A million variables and more: the fast greedy equivalence search algorithm for learning high-dimensional graphical causal models, with an application to functional magnetic resonance images. International Journal of Data Science and Analytics, 3:121–129, 2016.
- Microsoft Research. EconML: A Python Package for ML-Based Heterogeneous Treatment Effects Estimation. https://github.com/microsoft/EconML, 2019. Version 0.x.
- Ancestral graph markov models. Ann. Statist., 30(4):962–1030, 08 2002a.
- Ancestral graph markov models. Annals of Statistics, 30:962–1030, 2002b.
- Jakob Runge. Causal network reconstruction from time series: From theoretical assumptions to practical estimation. Chaos, 28 7:075310, 2018.
- Jakob Runge. Discovering contemporaneous and lagged causal relations in autocorrelated nonlinear time series datasets. In UAI, 2020.
- Detecting and quantifying causal associations in large nonlinear time series datasets. Science Advances, 5, 2019.
- Gideon Schwarz. Estimating the dimension of a model. Ann. Statist., 6(2):461–464, 03 1978.
- Learning mixed graphical models with separate sparsity parameters and stability-based model selection. BMC Bioinformatics, 17, 2016.
- Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome research, 13 11:2498–504, 2003.
- Dowhy: An end-to-end library for causal inference. arXiv preprint arXiv:2011.04216, 2020.
- A linear non-gaussian acyclic model for causal discovery. J. Mach. Learn. Res., 7:2003–2030, 2006.
- Causation, Prediction, and Search. Mit Press: Cambridge, 2000.
- Auto-weka: combined selection and hyperparameter optimization of classification algorithms. In KDD ’13, 2012.
- Sofia Triantafillou and I. Tsamardinos. Constraint-based causal discovery from multiple interventions over overlapping variable sets. ArXiv, abs/1403.2150, 2014.
- Sofia Triantafillou and I. Tsamardinos. Score-based vs constraint-based causal learning in the presence of confounders. In CFA@UAI, 2016.
- Towards principled feature selection: Relevancy, filters and wrappers. In Christopher M. Bishop and Brendan J. Frey, editors, Proceedings of the Ninth International Workshop on Artificial Intelligence and Statistics, volume R4 of Proceedings of Machine Learning Research, pages 300–307. PMLR, 03–06 Jan 2003.
- The max-min hill-climbing bayesian network structure learning algorithm. Machine Learning, 65:31–78, 2006.
- Bootstrapping the out-of-sample predictions for efficient and accurate cross-validation. Machine Learning, 107:1895 – 1922, 2018.
- Just add data: automated predictive modeling for knowledge discovery and feature selection. NPJ Precision Oncology, 6, 2022.
- On scoring maximal ancestral graphs with the max-min hill climbing algorithm. Int. J. Approx. Reason., 102:74–85, 2018.
- Jiji Zhang. Causal reasoning with ancestral graphs. J. Mach. Learn. Res., 9:1437–1474, 2008.
- Dags with no tears: Continuous optimization for structure learning. In Neural Information Processing Systems, 2018.
- Causal-learn: Causal discovery in python. arXiv preprint arXiv:2307.16405, 2023.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.