Efficient Causal Graph Discovery Using Large Language Models (2402.01207v4)
Abstract: We propose a novel framework that leverages LLMs for full causal graph discovery. While previous LLM-based methods have used a pairwise query approach, this requires a quadratic number of queries which quickly becomes impractical for larger causal graphs. In contrast, the proposed framework uses a breadth-first search (BFS) approach which allows it to use only a linear number of queries. We also show that the proposed method can easily incorporate observational data when available, to improve performance. In addition to being more time and data-efficient, the proposed framework achieves state-of-the-art results on real-world causal graphs of varying sizes. The results demonstrate the effectiveness and efficiency of the proposed method in discovering causal relationships, showcasing its potential for broad applicability in causal graph discovery tasks across different domains.
- Dagma: Learning dags via m-matrices and a log-determinant acyclicity characterization, 2023.
- Lmpriors: Pre-trained language models as task-specific priors. arXiv preprint arXiv: 2210.12530, 2022.
- Large language models are not strong abstract reasoners. arXiv preprint arXiv: 2305.19555, 2023.
- Mathprompter: Mathematical reasoning using large language models. Annual Meeting of the Association for Computational Linguistics, 2023. doi: 10.48550/arXiv.2303.05398.
- Causal reasoning and large language models: Opening a new frontier for causality. arXiv preprint arXiv: 2305.00050, 2023.
- Local computations with probabilities on graphical structures and their application to expert systems. Journal of the Royal Statistical Society. Series B (Methodological), 50(2):157–224, 1988. ISSN 00359246. URL http://www.jstor.org/stable/2345762.
- Large language models as counterfactual generator: Strengths and weaknesses. arXiv preprint arXiv: 2305.14791, 2023.
- Causal discovery with language models as imperfect experts, 2023a.
- Can large language models build causal graphs? arXiv preprint arXiv: 2303.05279, 2023b.
- Meek, C. Graphical Models: Selecting causal and statistical models. 4 2023. doi: 10.1184/R1/22696393.v1. URL https://kilthub.cmu.edu/articles/thesis/Graphical_Models_Selecting_causal_and_statistical_models/22696393.
- OpenAI. Gpt-4 technical report. arXiv preprint arXiv: 2303.08774, 2023.
- Pearl, J. Causality. Cambridge University Press, Cambridge, UK, 2 edition, 2009. ISBN 978-0-521-89560-6. doi: 10.1017/CBO9780511803161.
- Elements of Causal Inference: Foundations and Learning Algorithms. The MIT Press, 2017. ISBN 0262037319.
- Code llama: Open foundation models for code. arXiv preprint arXiv: 2308.12950, 2023.
- Schwarz, G. Estimating the dimension of a model. The Annals of Statistics, 6(2):461–464, 1978. ISSN 00905364. URL http://www.jstor.org/stable/2958889.
- Scutari, M. Learning bayesian networks with the bnlearn R package. Journal of Statistical Software, 35(3):1–22, 2010. doi: 10.18637/jss.v035.i03.
- Bayesian analysis in expert systems. Statistical Science, 8(3):219–247, 1993. ISSN 08834237. URL http://www.jstor.org/stable/2245959.
- An algorithm for fast recovery of sparse causal graphs. Social Science Computer Review, 9(1):62–72, 1991. doi: 10.1177/089443939100900106. URL https://doi.org/10.1177/089443939100900106.
- Causation, Prediction, and Search, volume 81. 01 1993. ISBN 978-1-4612-7650-0. doi: 10.1007/978-1-4612-2748-9.
- Neuropathic pain diagnosis simulator for causal discovery algorithm evaluation. Neural Information Processing Systems, 2019.
- PINTO: Faithful language reasoning using prompt-generated rationales. In The Eleventh International Conference on Learning Representations, 2023. URL https://openreview.net/forum?id=WBXbRs63oVu.
- Chain-of-thought prompting elicits reasoning in large language models. arXiv preprint arXiv: 2201.11903, 2022.
- Llms and the abstraction and reasoning corpus: Successes, failures, and the importance of object-based representations. arXiv preprint arXiv: 2305.18354, 2023.
- Tree of thoughts: Deliberate problem solving with large language models, 2023.
- A survey on causal discovery: Theory and practice, 2023.
- Large language models as commonsense knowledge for large-scale task planning. arXiv preprint arXiv: 2305.14078, 2023.
- Dags with no tears: Continuous optimization for structure learning, 2018.
- Causal-learn: Causal discovery in python. arXiv preprint arXiv:2307.16405, 2023.