Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
120 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Estimating Joint interventional distributions from marginal interventional data (2409.01794v1)

Published 3 Sep 2024 in stat.ME, cs.LG, and stat.ML

Abstract: In this paper we show how to exploit interventional data to acquire the joint conditional distribution of all the variables using the Maximum Entropy principle. To this end, we extend the Causal Maximum Entropy method to make use of interventional data in addition to observational data. Using Lagrange duality, we prove that the solution to the Causal Maximum Entropy problem with interventional constraints lies in the exponential family, as in the Maximum Entropy solution. Our method allows us to perform two tasks of interest when marginal interventional distributions are provided for any subset of the variables. First, we show how to perform causal feature selection from a mixture of observational and single-variable interventional data, and, second, how to infer joint interventional distributions. For the former task, we show on synthetically generated data, that our proposed method outperforms the state-of-the-art method on merging datasets, and yields comparable results to the KCI-test which requires access to joint observations of all variables.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. A maximum entropy approach to natural language processing. Computational linguistics, 22(1):39–71, 1996.
  2. JAX: composable transformations of Python+NumPy programs, 2018. URL http://github.com/google/jax.
  3. Causal discovery from a mixture of experimental and observational data. In Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence, pp.  116–125, 1999.
  4. Integrating locally learned causal structures with overlapping variables. Advances in Neural Information Processing Systems, 21, 2008.
  5. On a least squares adjustment of a sampled frequency table when the expected marginal totals are known. The Annals of Mathematical Statistics, 11(4):427–444, 1940.
  6. Exact bayesian structure learning from uncertain interventions. In Meila, M. and Shen, X. (eds.), Proceedings of the Eleventh International Conference on Artificial Intelligence and Statistics, volume 2 of Proceedings of Machine Learning Research, pp.  107–114, San Juan, Puerto Rico, 21–24 Mar 2007. PMLR. URL https://proceedings.mlr.press/v2/eaton07a.html.
  7. Identification of average causal effects in confounded additive noise models. arXiv preprint arXiv:2407.10014, 2024.
  8. A minimax approach to supervised learning. Advances in Neural Information Processing Systems, 29, 2016.
  9. Obtaining causal information by merging datasets with maxent. In International Conference on Artificial Intelligence and Statistics, pp.  581–603. PMLR, 2022.
  10. Causal inference through the structural causal marginal problem. In International Conference on Machine Learning, pp. 7793–7824. PMLR, 2022.
  11. Invariant causal prediction for nonlinear models. Journal of Causal Inference, 6(2), 2018.
  12. Rice yield grown in different fertilizer combination and planting methods: Case study in buru island, indonesia. Open Agriculture, 7(1):871–881, 2022.
  13. Janzing, D. Causal versions of maximum entropy and principle of insufficient reason. Journal of Causal Inference, 9(1):285–301, 2021.
  14. Distinguishing cause and effect via second order exponential models. arXiv preprint arXiv:0910.5561, 2009.
  15. Jaynes, E. T. Information theory and statistical mechanics. Physical review, 106(4):620, 1957.
  16. Jaynes, E. T. Probability theory: The logic of science. Cambridge university press, 2003.
  17. Disentangling causal effects from sets of interventions in the presence of unobserved confounders. Advances in Neural Information Processing Systems, 35:27850–27861, 2022.
  18. Kellerer, H. G. Maßtheoretische marginalprobleme. Mathematische Annalen, 153(3):168–198, June 1964. doi: 10.1007/bf01360315. URL https://doi.org/10.1007/bf01360315.
  19. Probabilistic graphical models: principles and techniques. MIT press, 2009.
  20. Joint causal inference from multiple contexts. The Journal of Machine Learning Research, 21(1):3919–4026, 2020.
  21. Pearl, J. Causality. Cambridge university press, 2009.
  22. The Book of Why: The New Science of Cause and Effect. Basic Books, Inc., USA, 1st edition, 2018. ISBN 046509760X.
  23. Causal inference by using invariant prediction: identification and confidence intervals. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 78(5):947–1012, 2016.
  24. Effect of irrigation and fertilizer management on rice yield and nitrogen loss: A meta-analysis. Plants, 11(13):1690, 2022.
  25. Learning joint nonlinear effects from single-variable interventions in the presence of hidden confounders. In Conference on Uncertainty in Artificial Intelligence, pp. 300–309. PMLR, 2020.
  26. Bounding probabilities of causation through the causal marginal problem. arXiv preprint arXiv:2304.02023, 2023.
  27. Causal inference by choosing graphs with most plausible markov kernels. In Ninth International Symposium on Artificial Intelligence and Mathematics (AIMath 2006), pp.  1–11, 2006.
  28. Causal discovery from changes. In Proceedings of the Seventeenth conference on Uncertainty in artificial intelligence, pp.  512–521, 2001.
  29. A general identification condition for causal effects. eScholarship, University of California, 2002.
  30. Learning equivalence classes of acyclic models with latent and selection variables from multiple datasets with overlapping variables. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pp.  3–15. JMLR Workshop and Conference Proceedings, 2011.
  31. Tillman, R. E. Structure learning with independent non-identically distributed data. In Proceedings of the 26th Annual International Conference on Machine Learning, pp.  1041–1048, 2009.
  32. Constraint-based causal discovery from multiple interventions over overlapping variable sets. The Journal of Machine Learning Research, 16(1):2147–2205, 2015.
  33. Graphical models, exponential families, and variational inference. Foundations and Trends® in Machine Learning, 1(1–2):1–305, 2008.
  34. Effect of fertilizer management on potassium dynamics and yield of rainfed lowland rice in indonesia. Chilean journal of agricultural research, 82(1):33–43, 2022.
  35. Kernel-based conditional independence test and application in causal discovery. In 27th Conference on Uncertainty in Artificial Intelligence (UAI 2011), pp.  804–813. AUAI Press, 2011.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets