Realization of Causal Representation Learning to Adjust Confounding Bias in Latent Space (2211.08573v9)
Abstract: Causal DAGs(Directed Acyclic Graphs) are usually considered in a 2D plane. Edges indicate causal effects' directions and imply their corresponding time-passings. Due to the natural restriction of statistical models, effect estimation is usually approximated by averaging the individuals' correlations, i.e., observational changes over a specific time. However, in the context of Machine Learning on large-scale questions with complex DAGs, such slight biases can snowball to distort global models - More importantly, it has practically impeded the development of AI, for instance, the weak generalizability of causal models. In this paper, we redefine causal DAG as \emph{do-DAG}, in which variables' values are no longer time-stamp-dependent, and timelines can be seen as axes. By geometric explanation of multi-dimensional do-DAG, we identify the \emph{Causal Representation Bias} and its necessary factors, differentiated from common confounding biases. Accordingly, a DL(Deep Learning)-based framework will be proposed as the general solution, along with a realization method and experiments to verify its feasibility.
- Pearl, Judea. Causal inference in statistics: An overview. (2009): 96-146.
- Pearl, Judea. The do-calculus revisited. arXiv preprint arXiv:1210.4852 (2012).
- Scheines, Richard. An introduction to causal inference. (1997).
- Guyon, Isabelle. Practical feature selection: from correlation to causality. Mining massive data sets for security (2008): 27-43.
- Zhao, Liping. The gut microbiota and obesity: from correlation to causality. Nature Reviews Microbiology 11.9 (2013): 639-647.
- Lecca, Paola. Machine learning for causal inference in biological networks: Perspectives of this challenge. Frontiers in Bioinformatics 1 (2021): 746712.
- Sobel, Michael E. An introduction to causal inference. Sociological Methods & Research 24.3 (1996): 353-379.
- Elwert, Felix. Graphical causal models. Handbook of causal analysis for social research (2013): 245-273.
- Plaut, Elad. From principal subspaces to principal components with linear autoencoders. arXiv preprint arXiv:1804.10253 (2018).
- Rong, Xin. word2vec parameter learning explained. arXiv:1411.2738 (2014).