Partial Counterfactual Identification of Continuous Outcomes with a Curvature Sensitivity Model (2306.01424v3)
Abstract: Counterfactual inference aims to answer retrospective "what if" questions and thus belongs to the most fine-grained type of inference in Pearl's causality ladder. Existing methods for counterfactual inference with continuous outcomes aim at point identification and thus make strong and unnatural assumptions about the underlying structural causal model. In this paper, we relax these assumptions and aim at partial counterfactual identification of continuous outcomes, i.e., when the counterfactual query resides in an ignorance interval with informative bounds. We prove that, in general, the ignorance interval of the counterfactual queries has non-informative bounds, already when functions of structural causal models are continuously differentiable. As a remedy, we propose a novel sensitivity model called Curvature Sensitivity Model. This allows us to obtain informative bounds by bounding the curvature of level sets of the functions. We further show that existing point counterfactual identification methods are special cases of our Curvature Sensitivity Model when the bound of the curvature is set to zero. We then propose an implementation of our Curvature Sensitivity Model in the form of a novel deep generative model, which we call Augmented Pseudo-Invertible Decoder. Our implementation employs (i) residual normalizing flows with (ii) variational augmentations. We empirically demonstrate the effectiveness of our Augmented Pseudo-Invertible Decoder. To the best of our knowledge, ours is the first partial identification model for Markovian structural causal models with continuous outcomes.
- Giovanni Alberti, Stefano Bianchini and Gianluca Crippa “Structure of level sets and Sard-type properties of Lipschitz maps” In Annali della Scuola Normale Superiore di Pisa-Classe di Scienze 12.4, 2013, pp. 863–902
- Peter M. Aronow, Donald P. Green and Donald K.K. Lee “Sharp bounds on the variance in randomized experiments” In The Annals of Statistics 42.3, 2014, pp. 850–871
- Chen Avin, Ilya Shpitser and Judea Pearl “Identifiability of path-specific effects” In International Joint Conference on Artificial Intelligence, 2005
- Sivaraman Balakrishnan, Edward Kennedy and Larry Wasserman “Conservative Inference for Counterfactuals” In arXiv preprint arXiv:2310.12757, 2023
- Vahid Balazadeh Meresht, Vasilis Syrgkanis and Rahul G. Krishnan “Partial Identification of Treatment Effects with Implicit Generative Models” In Advances in Neural Information Processing Systems, 2022
- “Counterfactual probabilities: Computational methods, bounds and applications” In Uncertainty Proceedings Elsevier, 1994, pp. 46–54
- “Estimating the effects of non-pharmaceutical interventions on the number of new infections with COVID-19 during the first epidemic wave” In PLoS one 16.6 Public Library of Science San Francisco, CA USA, 2021, pp. e0252827
- “On Pearl’s hierarchy and the foundations of causal inference” In Probabilistic and Causal Inference: The Works of Judea Pearl Association for Computing Machinery, 2022, pp. 507–556
- “Invertible residual networks” In International Conference on Machine Learning, 2019
- Jan Jetze Beitler, Ivan Sosnovik and Arnold Smeulders “PIE: Pseudo-invertible encoder” In arXiv preprint arXiv:2111.00619, 2021
- Matthew Blackwell “A selection bias approach to sensitivity analysis for causal effects” In Political Analysis 22.2 Cambridge University Press, 2014, pp. 169–182
- Vladimir Igorevich Bogachev and Maria Aparecida Soares Ruas “Measure theory” Springer Berlin, 2007
- “Foundations of structural causal models with cycles and latent variables” In The Annals of Statistics 49.5 Institute of Mathematical Statistics, 2021, pp. 2885–2915
- Matteo Bonvini and Edward H. Kennedy “Fast convergence rates for dose-response estimation” In arXiv preprint arXiv:2207.11825, 2022
- “Sensitivity Analysis for Marginal Structural Models” In arXiv preprint arXiv:2210.04681, 2022
- “Counterfactual Reasoning and Learning Systems: The Example of Computational Advertising” In Journal of Machine Learning Research 14.11, 2013
- “Sensitivity analyses for unmeasured confounding assuming a marginal structural model for repeated measures” In Statistics in Medicine 23.5 Wiley Online Library, 2004, pp. 749–767
- “Causal structure-based root cause analysis of outliers” In International Conference on Machine Learning, 2022
- “Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search” In International Conference on Learning Representations, 2019
- Lenart Celar and Ruth M.J. Byrne “How people reason with counterfactual and causal explanations for Artificial Intelligence decisions in familiar and unfamiliar domains” In Memory & Cognition Springer, 2023, pp. 1–16
- Patrick Chao, Patrick Blöbaum and Shiva Prasad Kasiviswanathan “Interventional and Counterfactual Inference with Diffusion Models” In arXiv preprint arXiv:2302.00860, 2023
- “Vflow: More expressive generative flows with variational data augmentation” In International Conference on Machine Learning, 2020
- “Neural ordinary differential equations” In Advances in Neural Information Processing Systems, 2018
- “Residual flows for invertible generative modeling” In Advances in Neural Information Processing Systems, 2019
- “Gaussianization” In Advances in Neural Information Processing Systems, 2000
- Florentin Coeurdoux, Nicolas Dobigeon and Pierre Chainais “Sliced-Wasserstein normalizing flows: Beyond maximum likelihood training” In European Symposium on Artificial Neural Networks, 2022
- Juan Correa, Sanghack Lee and Elias Bareinboim “Nested counterfactual identification from arbitrary surrogate experiments” In Advances in Neural Information Processing Systems, 2021
- “Nonparametric estimation of heterogeneous treatment effects: From theory to learning algorithms” In International Conference on Artificial Intelligence and Statistics, 2021
- Saloni Dash, Vineeth N. Balasubramanian and Amit Sharma “Evaluating and mitigating bias in image classifiers: A causal perspective using counterfactuals” In IEEE/CVF Winter Conference on Applications of Computer Vision, 2022
- A.Philip Dawid “Influence diagrams for causal modelling and inference” In International Statistical Review 70.2 Wiley Online Library, 2002, pp. 161–189
- Edward De Brouwer “Deep Counterfactual Estimation with Categorical Background Variables” In Advances in Neural Information Processing Systems, 2022
- “Transport-based counterfactual models” In arXiv preprint arXiv:2108.13025, 2021
- Laurent Dinh, David Krueger and Yoshua Bengio “NICE: Non-linear independent components estimation” In arXiv preprint arXiv:1410.8516, 2014
- Laurent Dinh, Jascha Sohl-Dickstein and Samy Bengio “Density estimation using Real NVP” In International Conference on Learning Representations, 2017
- “Sharp sensitivity analysis for inverse propensity weighting via quantile balancing” In Journal of the American Statistical Association Taylor & Francis, 2022, pp. 1–13
- “Agent incentives: A causal perspective” In AAAI Conference on Artificial Intelligence, 2021
- Yanqin Fan and Sang Soo Park “Sharp bounds on the distribution of treatment effects and their statistical inference” In Econometric Theory 26.3 Cambridge University Press, 2010, pp. 931–951
- “Counterfactual multi-agent policy gradients” In AAAI Conference on Artificial Intelligence, 2018
- “Estimating individual treatment effects under unobserved confounding using binary instruments” In International Conference on Learning Representations, 2023
- Dennis Frauen, Valentyn Melnychuk and Stefan Feuerriegel “Sharp Bounds for Generalized Causal Sensitivity Analysis” In Advances in Neural Information Processing Systems, 2023
- “A Neural Framework for Generalized Causal Sensitivity Analysis” In arXiv preprint arXiv:2311.16026, 2023
- Sainyam Galhotra, Romila Pradhan and Babak Salimi “Explaining black-box algorithms using probabilistic contrastive counterfactuals” In International Conference on Management of Data, 2021
- Ron Goldman “Curvature formulas for implicit curves and surfaces” In Computer Aided Geometric Design 22.7 Elsevier, 2005, pp. 632–658
- “Counterfactual visual explanations” In International Conference on Machine Learning, 2019
- “Scalable Reversible Generative Models with Free-form Continuous Dynamics” In International Conference on Learning Representations, 2019
- “Causal inference through the structural causal marginal problem” In International Conference on Machine Learning, 2022
- “The incomplete rosetta stone problem: Identifiability results for multi-view nonlinear ICA” In Conference on Uncertainty in Artificial Intelligence, 2020
- Florian Gunsilius “A path-sampling method to partially identify causal effects in instrumental variable models” In arXiv preprint arXiv:1910.09502, 2019
- “Partial identification with noisy covariates: A robust optimization approach” In Conference on Causal Learning and Reasoning, 2022
- “Towards formal definitions of blameworthiness, intention, and moral responsibility” In AAAI Conference on Artificial Intelligence, 2018
- Joseph Y. Halpern and Judea Pearl “Causes and explanations: A structural-model approach. Part I: Causes” In The British Journal for the Philosophy of Science The University of Chicago Press, 2005
- “Estimating average treatment effects via orthogonal regularization” In International Conference on Information and Knowledge Management, 2021
- Lars Hörmander “Composition with Smooth Maps” In The Analysis of Linear Partial Differential Operators I: Distribution Theory and Fourier Analysis Springer Berlin Heidelberg, 2003, pp. 133–157
- “Denoising normalizing flow” In Advances in Neural Information Processing Systems, 2021
- “A generative adversarial framework for bounding confounded causal effects” In AAAI Conference on Artificial Intelligence, 2021
- “Identifiability in causal bayesian networks: A sound and complete algorithm” In National Conference on Artificial Intelligence (AAAI), 2006
- Aapo Hyvarinen, Hiroaki Sasaki and Richard Turner “Nonlinear ICA using auxiliary variables and generalized contrastive learning” In International Conference on Artificial Intelligence and Statistics, 2019
- Guido W. Imbens and Joshua D. Angrist “Identification and estimation of local average treatment effects” In Econometrica 62.2 JSTOR, 1994, pp. 467–475
- “On the Identifiability and Estimation of Causal Location-Scale Noise Models” In International Conference on Machine Learning, 2023
- “Quantifying ignorance in individual-level causal-effect estimates under hidden confounding” In International Conference on Machine Learning, 2021
- “Scalable Sensitivity and Uncertainty Analyses for Causal-Effect Estimates of Continuous-Valued Interventions” In Advances in Neural Information Processing Systems, 2022
- Ying Jin, Zhimei Ren and Zhengyuan Zhou “Sensitivity analysis under the f𝑓fitalic_f-sensitivity models: A distributional robustness perspective” In arXiv preprint arXiv:2203.04373, 2022
- “Confounding-robust policy evaluation in infinite-horizon reinforcement learning” In Advances in Neural Information Processing Systems, 2020
- Amir-Hossein Karimi, Bernhard Schölkopf and Isabel Valera “Algorithmic recourse: From counterfactual explanations to interventions” In ACM Conference on Fairness, Accountability, and Transparency, 2021
- “A survey of algorithmic recourse: Definitions, formulations, solutions, and prospects” In arXiv preprint arXiv:2010.04050, 2020
- “Causal autoregressive flows” In International Conference on Artificial Intelligence and Statistics, 2021
- “Variational autoencoders and nonlinear ICA: A unifying framework” In International Conference on Artificial Intelligence and Statistics, 2020
- Niki Kilbertus, Matt J. Kusner and Ricardo Silva “A class of algorithms for general instrumental variable models” In Advances in Neural Information Processing Systems, 2020
- “The sensitivity of counterfactual fairness to unmeasured confounding” In Conference on Uncertainty in Artificial Intelligence, 2020
- “Counterfactual fairness with disentangled causal effect variational autoencoder” In AAAI Conference on Artificial Intelligence, 2021
- Diederik P. Kingma and Jimmy Ba “Adam: A Method for Stochastic Optimization” In International Conference on Learning Representations, 2015
- “Improved variational inference with inverse autoregressive flow” In Advances in Neural Information Processing Systems, 2016
- “Inference of Intention and Permissibility in Moral Decision Making” In CogSci, 2015
- “CausalGAN: Learning causal implicit generative models with adversarial training” In International Conference on Learning Representations, 2018
- Gunnar König, Timo Freiesleben and Moritz Grosse-Wentrup “Improvement-focused causal recourse (ICR)” In AAAI Conference on Artificial Intelligence, 2023
- “Counterfactual fairness” In Advances in Neural Information Processing Systems, 2017
- David A. Lagnado, Tobias Gerstenberg and Ro’i Zultan “Causal responsibility and counterfactuals” In Cognitive Science 37.6 Wiley Online Library, 2013, pp. 1036–1073
- John M. Lee “Riemannian manifolds: An introduction to curvature” Springer New York, 2006
- Sanghack Lee, Juan D. Correa and Elias Bareinboim “General identifiability with arbitrary surrogate experiments” In Conference on Uncertainty in Artificial Intelligence, 2020
- Ang Li, Scott Mueller and Judea Pearl “Epsilon-Identifiability of Causal Quantities” In arXiv preprint arXiv:2301.12022, 2023
- “Bounds on causal effects and application to high dimensional data” In AAAI Conference on Artificial Intelligence, 2022
- “Probabilities of Causation with Nonbinary Treatment and Effect” In arXiv preprint arXiv:2208.09568, 2022
- “Unit selection based on counterfactual logic” In International Joint Conference on Artificial Intelligence, 2019
- “Compositional perturbation autoencoder for single-cell response modeling” In BioRxiv Cold Spring Harbor Laboratory, 2021
- “Causal effect inference with deep latent-variable models” In Advances in Neural Information Processing Systems, 2017
- “Sample-efficient reinforcement learning via counterfactual-based data augmentation” In arXiv preprint arXiv:2012.09092, 2020
- Charles F. Manski “Monotone treatment response” In Econometrica: Journal of the Econometric Society JSTOR, 1997, pp. 1311–1334
- Charles F. Manski “Nonparametric bounds on treatment effects” In The American Economic Review 80.2 JSTOR, 1990, pp. 319–323
- “Partial identification of dose responses with hidden confounders” In Conference on Uncertainty in Artificial Intelligence, 2023
- Valentyn Melnychuk, Dennis Frauen and Stefan Feuerriegel “Normalizing flows for interventional density estimation” In International Conference on Machine Learning, 2023
- “Counterfactual Credit Assignment in Model-Free Reinforcement Learning” In International Conference on Machine Learning, 2021
- Wang Miao, Zhi Geng and Eric J. Tchetgen Tchetgen “Identifying causal effects with proxy variables of an unmeasured confounder” In Biometrika 105.4 Oxford University Press, 2018, pp. 987–993
- Arash Nasr-Esfahany, Mohammad Alizadeh and Devavrat Shah “Counterfactual Identifiability of Bijective Causal Models” In International Conference on Machine Learning, 2023
- “Counterfactual (non-)identifiability of Learned Structural Causal Models” In arXiv preprint arXiv:2301.09031, 2023
- Luigi Negro “Sample distribution theory using Coarea Formula” In Communications in Statistics Theory and Methods Taylor & Francis, 2022, pp. 1–26
- “The directional optimal transport” In The Annals of Applied Probability 32.2 Institute of Mathematical Statistics, 2022, pp. 1400–1420
- “Counterfactual off-policy evaluation with gumbel-max structural causal models” In International Conference on Machine Learning, 2019
- “B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under Hidden Confounding” In International Conference on Machine Learning, 2023
- “Stochastic Causal Programming for Bounding Treatment Effects” In Conference on Causal Learning and Reasoning, 2023
- Nick Pawlowski, Daniel Coelho de Castro and Ben Glocker “Deep structural causal models for tractable counterfactual inference” In Advances in Neural Information Processing Systems, 2020
- Judea Pearl “Causality” Cambridge University Press, 2009
- Judea Pearl “Causality: Models, Reasoning and Inference” Cambridge University Press, 2000
- Judea Pearl “Probabilities Of Causation: Three Counterfactual Interpretations And Their Identification” In Synthese 121 Springer, 1999, pp. 93–149
- “Causal fairness analysis” In arXiv preprint arXiv:2207.11385, 2022
- Boris T. Polyak and Anatoli B. Juditsky “Acceleration of stochastic approximation by averaging” In SIAM Journal on Control and Optimization 30.4 SIAM, 1992, pp. 838–855
- Ashesh Rambachan, Amanda Coston and Edward Kennedy “Counterfactual risk assessments under unmeasured confounding” In arXiv preprint arXiv:2212.09844, 2022
- “Variational inference with normalizing flows” In International Conference on Machine Learning, 2015
- Jonathan Richens, Rory Beard and Daniel H. Thompson “Counterfactual harm” In Advances in Neural Information Processing Systems, 2022
- James M. Robins, Andrea Rotnitzky and Daniel O. Scharfstein “Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models” In IMA Volumes in Mathematics and its Applications 116 Springer, 2000, pp. 1–94
- “Noise regularization for conditional density estimation” In arXiv preprint arXiv:1907.08982, 2019
- Donald B. Rubin “Estimating causal effects of treatments in randomized and nonrandomized studies.” In Journal of Educational Psychology 66.5 American Psychological Association, 1974, pp. 688
- “A general method for deriving tight symbolic bounds on causal effects” In Journal of Computational and Graphical Statistics Taylor & Francis, 2022, pp. 1–10
- Pedro Sanchez and Sotirios A. Tsaftaris “Diffusion Causal Models for Counterfactual Estimation” In Conference on Causal Learning and Reasoning, 2021
- Pablo Sánchez-Martin, Miriam Rateike and Isabel Valera “VACA: Designing variational graph autoencoders for causal queries” In AAAI Conference on Artificial Intelligence, 2022
- Numair Sani, Atalanti A. Mastakouri and Dominik Janzing “Bounding probabilities of causation through the causal marginal problem” In arXiv preprint arXiv:2304.02023, 2023
- George Saon, Satya Dharanipragada and Daniel Povey “Feature space gaussianization” In International Conference on Acoustics, Speech, and Signal Processing, 2004
- “Counterfactual Generative Networks” In International Conference on Learning Representations, 2021
- “Weakly Supervised Disentangled Generative Causal Representation Learning” In Journal of Machine Learning Research 23, 2022, pp. 1–55
- “Effects of treatment on the treated: Identification and generalization” In Conference on Uncertainty in Artificial Intelligence, 2009
- “Identification of joint interventional distributions in recursive semi-Markovian causal models” In National Conference on Artificial Intelligence (AAAI), 2006
- “What counterfactuals can be tested” In Conference on Uncertainty in Artificial Intelligence, 2007
- “Identification of Personalized Effects Associated With Causal Pathways” In Conference on Uncertainty in Artificial Intelligence, 2018
- “Causation, prediction, and search” MIT Press, 2000
- “Efficient training of low-curvature neural networks” In Advances in Neural Information Processing Systems, 2022
- Esteban G. Tabak and Eric Vanden-Eijnden “Density estimation by dual ascent of the log-likelihood” In Communications in Mathematical Sciences 8.1 International Press of Boston, 2010, pp. 217–233
- Zhiqiang Tan “A distributional approach for causal inference using propensity scores” In Journal of the American Statistical Association 101.476 Taylor & Francis, 2006, pp. 1619–1637
- “Probabilities of causation: Bounds and identification” In Annals of Mathematics and Artificial Intelligence 28.1-4 Springer, 2000, pp. 287–313
- “Sylvester normalizing flows for variational inference” In Conference on Uncertainty in Artificial Intelligence, 2018
- Athanasios Vlontzos, Bernhard Kainz and Ciarán M. Gilligan-Lee “Estimating categorical counterfactuals via deep twin networks” In Nature Machine Intelligence 5.2 Nature Publishing Group UK London, 2023, pp. 159–168
- “A Free Lunch with Influence Functions? Improving Neural Network Estimates with Concepts from Semiparametric Statistics” In arXiv preprint arXiv:2202.09096, 2022
- Linbo Wang and Eric Tchetgen Tchetgen “Bounded, efficient and multiply robust estimation of average treatment effects using instrumental variables” In Journal of the Royal Statistical Society: Series B 80.3, 2018, pp. 531–550
- Prince Zizhuang Wang and William Yang Wang “Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling” In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019
- Pengzhou Abel Wu and Kenji Fukumizu “β𝛽\betaitalic_β-Intact-VAE: Identifying and Estimating Causal Effects under Limited Overlap” In International Conference on Learning Representations, 2022
- “Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information” In International Conference on Learning Representations, 2023
- “Variational causal inference” In arXiv preprint arXiv:2209.05935, 2022
- “The causal-neural connection: Expressiveness, learnability, and inference” In Advances in Neural Information Processing Systems, 2021
- Kevin Muyuan Xia, Yushu Pan and Elias Bareinboim “Neural causal models for counterfactual identification and estimation” In International Conference on Learning Representations, 2023
- “Counterfactual-Based Prevented and Preventable Proportions” In Journal of Causal Inference 5.2 De Gruyter, 2017, pp. 1–15
- Teppei Yamamoto “Understanding the past: Statistical analysis of causal attribution” In American Journal of Political Science 56.1 Wiley Online Library, 2012, pp. 237–256
- Jinsung Yoon, James Jordon and Mihaela Schaar “GANITE: Estimation of Individualized Treatment Effects using Generative Adversarial Nets” In International Conference on Learning Representations, 2018
- Marco Zaffalon, Alessandro Antonucci and Rafael Cabañas “Structural causal models are (solvable by) credal networks” In International Conference on Probabilistic Graphical Models, 2020
- “Bounding counterfactuals under selection bias” In International Conference on Probabilistic Graphical Models, 2022
- “Learning to Bound Counterfactual Inference in Structural Causal Models from Observational and Randomised Data” In arXiv preprint arXiv:2212.02932, 2022
- “Relating graph neural networks to structural causal models” In arXiv preprint arXiv:2109.04173, 2021
- “Bounding causal effects on continuous outcome” In AAAI Conference on Artificial Intelligence, 2021
- “Fairness in decision-making: The causal explanation formula” In AAAI Conference on Artificial Intelligence, 2018
- Junzhe Zhang, Jin Tian and Elias Bareinboim “Partial counterfactual identification from observational and experimental data” In International Conference on Machine Learning, 2022