Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation (2403.14232v1)
Abstract: Estimating the individuals' potential response to varying treatment doses is crucial for decision-making in areas such as precision medicine and management science. Most recent studies predict counterfactual outcomes by learning a covariate representation that is independent of the treatment variable. However, such independence constraints neglect much of the covariate information that is useful for counterfactual prediction, especially when the treatment variables are continuous. To tackle the above issue, in this paper, we first theoretically demonstrate the importance of the balancing and prognostic representations for unbiased estimation of the heterogeneous dose-response curves, that is, the learned representations are constrained to satisfy the conditional independence between the covariates and both of the treatment variables and the potential responses. Based on this, we propose a novel Contrastive balancing Representation learning Network using a partial distance measure, called CRNet, for estimating the heterogeneous dose-response curves without losing the continuity of treatments. Extensive experiments are conducted on synthetic and real-world datasets demonstrating that our proposal significantly outperforms previous methods.
- Permutation weighting. In International Conference on Machine Learning, 331–341. PMLR.
- Counterfactual representation learning with balancing weights. In International Conference on Artificial Intelligence and Statistics, 1972–1980. PMLR.
- Invertible residual networks. In International Conference on Machine Learning, 573–582. PMLR.
- Estimating the effects of continuous-valued interventions using generative adversarial networks. Advances in Neural Information Processing Systems, 33: 16434–16445.
- A simple framework for contrastive learning of visual representations. In International Conference on Machine Learning, 1597–1607. PMLR.
- Exploring simple siamese representation learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 15750–15758.
- Club: A contrastive log-ratio upper bound of mutual information. In International Conference on Machine Learning, 1779–1788. PMLR.
- Covariate balancing propensity score for a continuous treatment: Application to the efficacy of political advertisements. The Annals of Applied Statistics, 12(1): 156–177.
- Exploiting Contrastive Learning and Numerical Evidence for Confusing Legal Judgment Prediction. In Findings of the Association for Computational Linguistics: EMNLP 2023, 12174–12185.
- Generative adversarial networks. Communications of the ACM, 63(11): 139–144.
- Bootstrap your own latent-a new approach to self-supervised learning. Advances in neural information processing systems, 33: 21271–21284.
- Hahn, J. 1998. On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica, 315–331.
- Hainmueller, J. 2012. Entropy balancing for causal effects: A multivariate reweighting method to produce balanced samples in observational studies. Political analysis, 20(1): 25–46.
- Hansen, B. B. 2008. The prognostic analogue of the propensity score. Biometrika, 95(2): 481–488.
- Momentum contrast for unsupervised visual representation learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 9729–9738.
- The propensity score with continuous treatments. Applied Bayesian modeling and causal inference from incomplete-data perspectives, 226164: 73–84.
- Holland, P. W. 1986. Statistics and causal inference. Journal of the American statistical Association, 81(396): 945–960.
- Estimation of mean response via the effective balancing score. Biometrika, 101(3): 613–624.
- Joint sufficient dimension reduction and estimation of conditional and average treatment effects. Biometrika, 104(3): 583–596.
- Towards the generalization of contrastive self-supervised learning. arXiv preprint arXiv:2111.00743.
- Covariate balancing propensity score. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 76(1): 243–263.
- Causal inference with general treatment regimes: Generalizing the propensity score. Journal of the American Statistical Association, 99(467): 854–866.
- Imbens, G. W. 2000. The role of the propensity score in estimating dose-response functions. Biometrika, 87(3): 706–710.
- Understanding Dimensional Collapse in Contrastive Self-supervised Learning. In International Conference on Learning Representations.
- Kallus, N. 2020. Deepmatch: Balancing deep covariate representations for causal inference using adversarial training. In International Conference on Machine Learning, 5067–5077. PMLR.
- Non-parametric methods for doubly robust estimation of continuous treatment effects. Journal of the Royal Statistical Society. Series B (Statistical Methodology), 79(4): 1229–1245.
- A tutorial on energy-based learning. Predicting structured data, 1(0).
- Review and comparison of treatment effect estimators using propensity and prognostic scores. The international journal of biostatistics, 18(2): 357–380.
- Propensity matters: Measuring and enhancing balancing for recommendation. In International Conference on Machine Learning, 20182–20194. PMLR.
- Trustworthy policy learning under the counterfactual no-harm criterion. In International Conference on Machine Learning, 20575–20598. PMLR.
- Who should be given incentives? counterfactual optimal treatment regimes learning for recommendation. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 1235–1247.
- Statistical inference for causal effects. Modern analysis of customer surveys: With applications using R, 171–192.
- Vcnet and functional targeted regularization for learning causal effects of continuous treatments. In International Conference on Learning Representations.
- Pearl, J. 2009. Causality. Cambridge university press.
- Leveraging “big data” in respiratory medicine–data science, causal inference, and precision medicine. Expert Review of Respiratory Medicine, 15(6): 717–721.
- The central role of the propensity score in observational studies for causal effects. Biometrika, 70(1): 41–55.
- Rubin, D. B. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology, 66(5): 688.
- Learning counterfactual representations for estimating individual dose-response curves. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, 5612–5619.
- Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Machine Learning, 3076–3085. PMLR.
- A selective review of negative control methods in epidemiology. Current epidemiology reports, 7(4): 190–202.
- Prognostic score–based balance measures can be a useful diagnostic for propensity score methods in comparative effectiveness research. Journal of clinical epidemiology, 66(8): S84–S90.
- Partial distance correlation with methods for dissimilarities. The Annals of Statistics, 42(6): 2382–2412.
- Nonparametric estimation of population average dose-response curves using entropy balancing weights for continuous exposures. Health Services and Outcomes Research Methodology, 21(1): 69–110.
- Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113(523): 1228–1242.
- Optimal transport for treatment effect estimation. Advances in Neural Information Processing Systems.
- Understanding contrastive representation learning through alignment and uniformity on the hypersphere. In International Conference on Machine Learning, 9929–9939. PMLR.
- Instrumental variable regression with confounder balancing. In International Conference on Machine Learning, 24056–24075. PMLR.
- Stable estimation of heterogeneous treatment effects. In International Conference on Machine Learning, 37496–37510. PMLR.
- Learning decomposed representations for treatment effect estimation. IEEE Transactions on Knowledge and Data Engineering, 35(5): 4989–5001.
- Unsupervised feature learning via non-parametric instance discrimination. In Proceedings of the IEEE conference on computer vision and pattern recognition, 3733–3742.
- Contrastive Learning with Positive-Negative Frame Mask for Music Representation. In WWW ’22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25 - 29, 2022, 2906–2915. ACM.
- Tree structure-aware few-shot image classification via hierarchical aggregation. In European Conference on Computer Vision, 453–470. Springer.
- CauseRec: Counterfactual User Sequence Synthesis for Sequential Recommendation. In SIGIR ’21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual Event, Canada, July 11-15, 2021, 367–377. ACM.