Causal Graph Discovery with Retrieval-Augmented Generation based Large Language Models (2402.15301v2)
Abstract: Causal graph recovery is traditionally done using statistical estimation-based methods or based on individual's knowledge about variables of interests. They often suffer from data collection biases and limitations of individuals' knowledge. The advance of LLMs provides opportunities to address these problems. We propose a novel method that leverages LLMs to deduce causal relationships in general causal graph recovery tasks. This method leverages knowledge compressed in LLMs and knowledge LLMs extracted from scientific publication database as well as experiment data about factors of interest to achieve this goal. Our method gives a prompting strategy to extract associational relationships among those factors and a mechanism to perform causality verification for these associations. Comparing to other LLM-based methods that directly instruct LLMs to do the highly complex causal reasoning, our method shows clear advantage on causal graph quality on benchmark datasets. More importantly, as causality among some factors may change as new research results emerge, our method show sensitivity to new evidence in the literature and can provide useful information for updating causal graphs accordingly.
- Identification of causal effects using instrumental variables. Journal of the American statistical Association, 91(434):444–455.
- From query tools to causal architects: Harnessing large language models for advanced causal discovery from data.
- Recovering from selection bias in causal and statistical inference. Proceedings of the AAAI Conference on Artificial Intelligence, 28(1).
- Improving language models by retrieving from trillions of tokens. In International conference on machine learning, pages 2206–2240. PMLR.
- PubMed Central. https://www.ncbi.nlm.nih.gov/pmc/.
- David Maxwell Chickering. 2002. Optimal structure identification with greedy search. Journal of machine learning research, 3(Nov):507–554.
- Lmpriors: Pre-trained language models as task-specific priors. arXiv preprint arXiv:2210.12530.
- Review of causal discovery methods based on graphical models. Frontiers in genetics, 10:524.
- Thirteen theorems in search of the truth. Theory and Decision, 15:261–278.
- Marc Höfler. 2005. Causal inference based on counterfactuals. BMC medical research methodology, 5(1):1–12.
- Guido W. Imbens and Donald B. Rubin. 2015. Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. Cambridge University Press.
- Efficient causal graph discovery using large language models.
- Causal reasoning and large language models: Opening a new frontier for causality. arXiv preprint arXiv:2305.00050.
- Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, 33:9459–9474.
- Can large language models build causal graphs? In NeurIPS 2022 Workshop on Causality for Real-world Impact.
- Judea Pearl. 1995. Causal diagrams for empirical research. Biometrika, 82(4):669–688.
- Judea Pearl. 2000. Causality: Models, Reasoning and Inference. Cambridge University Press, New York.
- PubMed. https://pubmed.ncbi.nlm.nih.gov/.
- Joseph Ramsey and Bryan Andrews. 2018. Fask with interventional knowledge recovers edges from the sachs model. arXiv preprint arXiv:1805.03108.
- The probabilistic relevance framework: Bm25 and beyond. Foundations and Trends® in Information Retrieval, 3(4):333–389.
- Donald B Rubin. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology, 66(5):688.
- Causal protein-signaling networks derived from multiparameter single-cell data. Science, 308(5721):523–529.
- Causal discovery of feedback networks with functional magnetic resonance imaging. bioRxiv, page 245936.
- A linear non-gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7(10).
- Peter Spirtes and Clark Glymour. 1991. An algorithm for fast recovery of sparse causal graphs. Social Science Computer Review, 9(1):62–72.
- Causation, Prediction, and Search. The MIT Press.
- Causal inference using llm-guided discovery.
- Revisiting relation extraction in the era of large language models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15566–15589, Toronto, Canada. Association for Computational Linguistics.
- Self-consistency improves chain of thought reasoning in language models. arXiv preprint arXiv:2203.11171.
- C-pack: Packaged resources to advance general chinese embedding. arXiv preprint arXiv:2309.07597.
- A survey on causal inference. ACM Transactions on Knowledge Discovery from Data (TKDD), 15(5):1–46.
- Causal parrots: Large language models may talk causality but are not causal. Transactions on Machine Learning Research.
- Causal discovery in the presence of measurement error: Identifiability conditions. arXiv preprint arXiv:1706.03768.
- A survey of large language models. arXiv preprint arXiv:2303.18223.
- Emerging synergies in causality and deep generative models: A survey.
- Yuzhe Zhang (25 papers)
- Yipeng Zhang (42 papers)
- Yidong Gan (3 papers)
- Lina Yao (194 papers)
- Chen Wang (600 papers)