Papers
Topics
Authors
Recent
Search
2000 character limit reached

Use of Prior Knowledge to Discover Causal Additive Models with Unobserved Variables and its Application to Time Series Data

Published 14 Jan 2024 in cs.LG, stat.ME, and stat.ML | (2401.07231v3)

Abstract: This paper proposes two methods for causal additive models with unobserved variables (CAM-UV). CAM-UV assumes that the causal functions take the form of generalized additive models and that latent confounders are present. First, we propose a method that leverages prior knowledge for efficient causal discovery. Then, we propose an extension of this method for inferring causality in time series data. The original CAM-UV algorithm differs from other existing causal function models in that it does not seek the causal order between observed variables, but rather aims to identify the causes for each observed variable. Therefore, the first proposed method in this paper utilizes prior knowledge, such as understanding that certain variables cannot be causes of specific others. Moreover, by incorporating the prior knowledge that causes precedes their effects in time, we extend the first algorithm to the second method for causal discovery in time series data. We validate the first proposed method by using simulated data to demonstrate that the accuracy of causal discovery increases as more prior knowledge is accumulated. Additionally, we test the second proposed method by comparing it with existing time series causal discovery methods, using both simulated data and real-world data.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (22)
  1. CAM: Causal additive models, high-dimensional order search and penalized regression. Annals of Statistics 42, 6 (2014), 2526–2556.
  2. Causal discovery with cascade nonlinear additive noise model. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19 (7 2019), International Joint Conferences on Artificial Intelligence Organization, pp. 1609–1615.
  3. Chickering, D. M. Optimal structure identification with greedy search. Journal of Machine Learning Research 3, Nov (2002), 507–554.
  4. Search for additive nonlinear time series causal models. Journal of Machine Learning Research 9, 32 (2008), 967–991.
  5. Causal discovery for climate research using graphical models. Journal of Climate 25, 17 (2012), 5648 – 5665.
  6. On causal discovery from time series data using FCI. In Proceedings of the 5th European Workshop on Probabilistic Graphical Models (2010).
  7. High-recall causal discovery for autocorrelated time series with latent confounders. In Advances in Neural Information Processing Systems (2020), H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, Eds., vol. 33, Curran Associates, Inc., pp. 12615–12625.
  8. Generalized additive models. Chapman and Hall/CRC, 1990.
  9. Estimation of a structural vector autoregression model using non-gaussianity. Journal of Machine Learning Research 11, 56 (2010), 1709–1731.
  10. Price discovery between carbonated soft drink manufacturers and retailers: A disaggregate analysis with PC and LiNGAM algorithms. Journal of Applied Economics 18, 1 (2015), 173–197.
  11. Causal additive models with unobserved variables. In Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence (2021), C. de Campos and M. H. Maathuis, Eds., vol. 161 of Proceedings of Machine Learning Research, PMLR, pp. 97–106.
  12. Discovery of causal additive models in the presence of unobserved variables. arXiv preprint arXiv: 2106.02234 (2021).
  13. Causal structure learning from multivariate time series in settings with unmeasured confounding. In Proceedings of 2018 ACM SIGKDD Workshop on Causal Disocvery (2018), T. D. Le, K. Zhang, E. Kıcıman, A. Hyvärinen, and L. Liu, Eds., vol. 92 of Proceedings of Machine Learning Research, PMLR, pp. 23–47.
  14. Causal inference on time series using restricted structural equation models. In Advances in Neural Information Processing Systems (2013), C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Weinberger, Eds., vol. 26, Curran Associates, Inc.
  15. Causal discovery with continuous additive noise models. The Journal of Machine Learning Research 15, 1 (2014), 2009–2053.
  16. Detecting and quantifying causal associations in large nonlinear time series datasets. Science Advances 5, 11 (2019).
  17. A linear non-Gaussian acyclic model for causal discovery. Journal of Machine Learning Research 7, Oct (2006), 2003–2030.
  18. DirectLiNGAM: a direct method for learning a linear non-Gaussian structural equation model. Journal of Machine Learning Research 12, Apr (2011), 1225–1248.
  19. Network modelling methods for FMRI. NeuroImage 54, 2 (2011), 875–891.
  20. An algorithm for fast recovery of sparse causal graphs. Social Science Computer Review 9, 1 (1991), 62–72.
  21. Causal discovery in the presence of latent variables and selection bias. In Computation, Causality, and Discovery, G. F. Cooper and C. N. Glymour, Eds. AAAI Press, 1999, pp. 211–252.
  22. DAGs with NO TEARS: Continuous optimization for structure learning. In Advances in Neural Information Processing Systems (2018), S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, Eds., vol. 31, Curran Associates, Inc.

Summary

  • The paper presents the CAM-UV-PK method, which incorporates explicit prior knowledge to rule out impossible causal links, enhancing precision and F-measure.
  • It introduces TS-CAM-UV for time series data, using temporal priority to accurately model cause-effect relationships in dynamic settings.
  • Both methods are validated on simulated and real-world datasets, demonstrating robust performance compared to existing causal discovery techniques.

Overview

A paper focusing on causal discovery has introduced two significant advancements targeting a specialized subset of causal inference problems. These problems involve situations with unobserved variables—factors that can influence the outcome but are not measured or included in the data. Traditional models often struggle with these hidden confounders, leading to incorrect causal relationships. To tackle this, the authors propose enhancements to the Causal Additive Models with Unobserved Variables (CAM-UV) framework.

Advancements in Causal Discovery

CAM-UV with Prior Knowledge (CAM-UV-PK)

Understanding causal relations when some variables remain unobserved is challenging. The CAM-UV-PK method builds on the existing CAM-UV algorithm. It integrates prior knowledge in the form of explicit statements ruling out certain causes, leading to more accurate causal analysis. The provided prior knowledge helps to define which variables cannot be a cause for others, enhancing causality detection accuracy.

Time Series CAM-UV (TS-CAM-UV)

The second method, TS-CAM-UV, focuses on time series data while incorporating the assumption that past events influence future events but not vice versa. This concept, known as "time priority," is used as prior knowledge to refine causal inferences. The novelty here is that this approach combines the power of functional causal models with the flexibility to include unobserved variables, which is a first in this field.

Validation through Simulated and Real-World Data

The validation of the proposed methods involved rigorous testing against both simulated and real-world datasets. For CAM-UV-PK, the improvement was clear: precision and the overall F-measure increased with the incorporation of more prior knowledge. This proved the usefulness of prior knowledge in refining causal inferences.

Moving to the dynamic field, TS-CAM-UV was compared with existing time series causal discovery methods, explicitly LPCMCI and VarLiNGAM. The results were promising, showing superior or comparable performance across precision, recall, and F-measure, especially as sample sizes increased.

Significance and Future Directions

The significance of these proposed methods lies in their ability to identify causal relationships in the presence of latent confounders accurately. This enhances our understanding of complex systems where not all influencing factors are observable.

While the current focus is on models with acyclical causal graphs, the researchers plan to extend their framework to cyclical models. As data slices become finer, the opportunity for capturing rapid causal effects increases, necessitating methods that can handle contemporaneous effects with cycles.

Conclusion

In conclusion, this paper bridges a critical gap in causal discovery for data with unobserved variables. Whether dealing with static snapshots or evolving over time, CAM-UV-PK and TS-CAM-UV provide robust tools for unraveling the intricate webs of causality that underpin the data-rich world around us.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 3 tweets with 30 likes about this paper.