Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning (2401.16974v1)

Published 30 Jan 2024 in cs.LG and cs.AI

Abstract: Causal discovery is the challenging task of inferring causal structure from data. Motivated by Pearl's Causal Hierarchy (PCH), which tells us that passive observations alone are not enough to distinguish correlation from causation, there has been a recent push to incorporate interventions into machine learning research. Reinforcement learning provides a convenient framework for such an active approach to learning. This paper presents CORE, a deep reinforcement learning-based approach for causal discovery and intervention planning. CORE learns to sequentially reconstruct causal graphs from data while learning to perform informative interventions. Our results demonstrate that CORE generalizes to unseen graphs and efficiently uncovers causal structures. Furthermore, CORE scales to larger graphs with up to 10 variables and outperforms existing approaches in structure estimation accuracy and sample efficiency. All relevant code and supplementary material can be found at https://github.com/sa-and/CORE

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. Active learning of causal structures with deep reinforcement learning. Neural Networks 154 (10 2022), 22–30. https://doi.org/10.1016/J.NEUNET.2022.06.028
  2. A Medium-Scale Distributed System for Computer Science Research: Infrastructure for the Long Term. Computer 49, 5 (5 2016), 54–63. https://doi.org/10.1109/MC.2016.127
  3. On Pearl’s Hierarchy and the Foundations of Causal Inference. In Probabilistic and Causal Inference. ACM, New York, NY, USA, 507–556. https://doi.org/10.1145/3501714.3501743
  4. GFlowNet Foundations. Journal of Machine Learning Research 24, 210 (2023), 1–55. http://jmlr.org/papers/v24/22-0364.html
  5. Differentiable Causal Discovery from Interventional Data. In Advances in Neural Information Processing Systems, H Larochelle, M Ranzato, R Hadsell, M F Balcan, and H Lin (Eds.), Vol. 33. Curran Associates, Inc., 21865–21877. https://proceedings.neurips.cc/paper_files/paper/2020/file/f8b7aa3a0d349d9562b424160ad18612-Paper.pdf
  6. David Maxwell Chickering. 2003. Optimal Structure Identification with Greedy Search. J. Mach. Learn. Res. 3, null (3 2003), 507–554. https://doi.org/10.1162/153244303321897717
  7. Bayesian structure learning with generative flow networks. In Proceedings of the Thirty-Eighth Conference on Uncertainty in Artificial Intelligence (Proceedings of Machine Learning Research, Vol. 180), James Cussens and Kun Zhang (Eds.). PMLR, 518–528. https://proceedings.mlr.press/v180/deleu22a.html
  8. P. Erdós and A. Rényi. 1959. On random graphs. Publicationes Mathematicae (1959), 290–297.
  9. Factored Adaptation for Non-Stationary Reinforcement Learning. In Advances in Neural Information Processing Systems, S Koyejo, S Mohamed, A Agarwal, D Belgrave, K Cho, and A Oh (Eds.), Vol. 35. Curran Associates, Inc., 31957–31971. https://proceedings.neurips.cc/paper_files/paper/2022/file/cf4356f994917177213c55ff438ddf71-Paper-Conference.pdf
  10. Review of Causal Discovery Methods Based on Graphical Models. Frontiers in Genetics 10 (6 2019). https://doi.org/10.3389/fgene.2019.00524
  11. Uzma Hasan and Md Osman Gani. 2022. Kcrl: A prior knowledge based causal discovery framework with reinforcement learning. In Machine Learning for Healthcare Conference. 691–714.
  12. Alain Hauser and Buhlmann@stat Math Ethz Ch. 2012. Characterization and Greedy Learning of Interventional Markov Equivalence Classes of Directed Acyclic Graphs Peter B ¨ uhlmann. Journal of Machine Learning Research 13 (2012), 2409–2464.
  13. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation 9, 8 (11 1997), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
  14. Constraint-based Causal Discovery: Conflict Resolution with Answer Set Programming. In Conference on Uncertainty in Artificial Intelligence. 340–349.
  15. Calculus on MDPs: Potential Shaping as a Gradient. (8 2022). https://arxiv.org/abs/2208.09570v2
  16. Causal Discovery for Modular World Models. In NeurIPS 2022 Workshop on Neuro Causal and Symbolic AI (nCSI). https://openreview.net/forum?id=VfkjQzdGCH
  17. GFlowCausal: Generative Flow Networks for Causal Discovery. (2022). http://arxiv.org/abs/2210.08185
  18. Efficient Neural Causal Discovery without Acyclicity Constraints. In International Conference on Learning Representations.
  19. Amortized Inference for Causal Structure Learning. In Advances in Neural Information Processing Systems, S Koyejo, S Mohamed, A Agarwal, D Belgrave, K Cho, and A Oh (Eds.), Vol. 35. Curran Associates, Inc., 13104–13118. https://proceedings.neurips.cc/paper_files/paper/2022/file/54f7125dee9b8b3dc798bb9a082b09e2-Paper-Conference.pdf
  20. Predicting causal effects in large-scale systems from observational data. Nature Methods 2010 7:4 7, 4 (4 2010), 247–248. https://doi.org/10.1038/nmeth0410-247
  21. Causal Discovery and Reinforcement Learning: A Synergistic Integration. In Proceedings of The 11th International Conference on Probabilistic Graphical Models (Proceedings of Machine Learning Research, Vol. 186), Antonio Salmerón and Rafael Rumi (Eds.). PMLR, 421–432. https://proceedings.mlr.press/v186/mendez-molina22a.html
  22. Human-level control through deep reinforcement learning. Nature 518 (2015). https://doi.org/10.1038/nature14236
  23. Joint Causal Inference from Multiple Contexts. Journal of Machine Learning Research 21 (2020), 1–108. http://jmlr.org/papers/v21/17-123.html.
  24. Causal Induction from Visual Observations for Goal Directed Tasks. (2019).
  25. Policy invariance under reward transformations: Theory and application to reward shaping. ICML (1999), 278–287.
  26. Judea Pearl. 2009. Introduction to Probabilities, Graphs, and Causal Models. In Causality. Cambridge University Press, 1–40. https://doi.org/10.1017/CBO9780511803161.003
  27. Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data. Science 308, 5721 (4 2005), 523–529. https://doi.org/10.1126/science.1105809
  28. A Meta-Reinforcement Learning Algorithm for Causal Discovery. In Proceedings of the Second Conference on Causal Learning and Reasoning (Proceedings of Machine Learning Research, Vol. 213), Mihaela van der Schaar, Cheng Zhang, and Dominik Janzing (Eds.). PMLR, 602–619. https://proceedings.mlr.press/v213/sauter23a.html
  29. Learning Neural Causal Models with Active Interventions. (2022).
  30. Learning Causal Graphs with Small Interventions. In Advances in Neural Information Processing Systems, C Cortes, N Lawrence, D Lee, M Sugiyama, and R Garnett (Eds.), Vol. 28. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2015/file/b865367fc4c0845c0682bd466e6ebf4c-Paper.pdf
  31. Causation, prediction, and search. MIT press.
  32. R.S. Sutton and A.G. Barto. 2018. Reinforcement learning: An introduction. MIT press.
  33. Interventions, Where and How? Experimental Design for Causal Models at Scale. Advances in Neural Information Processing Systems 35 (12 2022), 24130–24143. https://github.com/yannadani/cbed
  34. Meta learning for causal direction. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 9897–9905.
  35. D’ya Like DAGs? A Survey on Structure Learning and Causal Discovery. Comput. Surveys 55, 4 (4 2023), 1–36. https://doi.org/10.1145/3527154
  36. Ordering-Based Causal Discovery with Reinforcement Learning. (2021). https://arxiv.org/abs/2105.
  37. Christopher J C H Watkins and Peter Dayan. 1992. Q-Learning. 8 (1992), 279–292.
  38. DAG-GNN: DAG Structure Learning with Graph Neural Networks. In Proceedings of the 36th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). PMLR, 7154–7163. https://proceedings.mlr.press/v97/yu19a.html
  39. DAGs with NO TEARS: Continuous Optimization for Structure Learning. (2018). https://github.com/xunzheng/notears.
  40. Causal Discovery with Reinforcement Learning. (2019). http://arxiv.org/abs/1906.04477
  41. Matjaz Zwitter and Milan Soklic. 1988. Breast Cancer. UCI Machine Learning Repository.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Andreas W. M. Sauter (2 papers)
  2. Nicolò Botteghi (19 papers)
  3. Erman Acar (27 papers)
  4. Aske Plaat (76 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com