Graph Neural Network with Two Uplift Estimators for Label-Scarcity Individual Uplift Modeling (2403.06489v1)
Abstract: Uplift modeling aims to measure the incremental effect, which we call uplift, of a strategy or action on the users from randomized experiments or observational data. Most existing uplift methods only use individual data, which are usually not informative enough to capture the unobserved and complex hidden factors regarding the uplift. Furthermore, uplift modeling scenario usually has scarce labeled data, especially for the treatment group, which also poses a great challenge for model training. Considering that the neighbors' features and the social relationships are very informative to characterize a user's uplift, we propose a graph neural network-based framework with two uplift estimators, called GNUM, to learn from the social graph for uplift estimation. Specifically, we design the first estimator based on a class-transformed target. The estimator is general for all types of outcomes, and is able to comprehensively model the treatment and control group data together to approach the uplift. When the outcome is discrete, we further design the other uplift estimator based on our defined partial labels, which is able to utilize more labeled data from both the treatment and control groups, to further alleviate the label scarcity problem. Comprehensive experiments on a public dataset and two industrial datasets show a superior performance of our proposed framework over state-of-the-art methods under various evaluation metrics. The proposed algorithms have been deployed online to serve real-world uplift estimation scenarios.
- A survey and benchmarking study of multitreatment uplift modeling. Data Mining and Knowledge Discovery, 34(2):273–308, 2020.
- Revenue uplift modeling. Machine Learning for Marketing Decision Support, 2017.
- Uplift modeling with survival data. In ACM SIGKDD Workshop on Health Informatics (HI-KDD–14), New York City, 2014.
- Uplift modeling for clinical trial data. In ICML Workshop on Clinical Data Analysis, 2012.
- Uplift modeling for preventing student dropout in higher education. Decision Support Systems, 134:113320, 2020.
- Counterfactual evaluation of treatment assignment functions with networked observational data. In Proceedings of the 2020 SIAM International Conference on Data Mining, pages 271–279. SIAM, 2020.
- Causal inference and uplift modelling: A review of the literature. In International Conference on Predictive Applications and APIs, pages 1–13, 2017.
- Data-driven variable decomposition for treatment effect estimation. IEEE Transactions on Knowledge and Data Engineering, 2020.
- Uplift random forests. Cybernetics and Systems, 46(3-4):230–248, 2015.
- Decision trees for uplift modeling with single and multiple treatments. Knowledge and Information Systems, 32(2):303–327, 2012.
- Leo Breiman. Random forests. Machine learning, 45(1):5–32, 2001.
- Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785–794, 2016.
- Using embeddings to correct for unobserved confounding in networks. Advances in Neural Information Processing Systems, 32, 2019.
- Learning individual causal effects from networked observational data. In Proceedings of the 13th International Conference on Web Search and Data Mining, pages 232–240, 2020.
- Nicholas J Radcliffe. Using control groups to target on predicted lift: Building and assessing uplift models. Direct Marketing Analytics Journal, 1(3):14–21, 2007.
- Uplift modeling with roc: An srl case study. In ILP (Late Breaking Papers), pages 40–45, 2013.
- Real-world uplift modelling with significance-based uplift trees. White Paper TR-2011-1, Stochastic Solutions, pages 1–33, 2011.
- Ensemble methods for uplift modeling. Data mining and knowledge discovery, 29(6):1531–1559, 2015.
- Machine learning methods for estimating heterogeneous causal effects. stat, 1050(5):1–26, 2015.
- Support vector machines for uplift modeling. In 2013 IEEE 13th International Conference on Data Mining Workshops, pages 131–138. IEEE, 2013.
- Optimal personalized treatment rules for marketing interventions: A review of methods, a new proposal, and an insurance case study. UB Riskcenter Working Paper Series, 2014/06, 2014.
- Implementing double-robust estimators of causal effects. The Stata Journal, 8(3):334–353, 2008.
- Doubly robust learning for estimating individualized treatment with censored data. Biometrika, 102(1):151–168, 2015.
- Estimating identifiable causal effects through double machine learning. In Proceedings of the 35th AAAI Conference on Artificial Intelligence, 2021.
- Double/debiased machine learning for treatment and structural parameters, 2018.
- Efficient estimation of average treatment effects using the estimated propensity score. Econometrica, 71(4):1161–1189, 2003.
- Who should be treated? empirical welfare maximization methods for treatment choice. Econometrica, 86(2):591–616, 2018.
- Causal effect inference with deep latent-variable models. Advances in neural information processing systems, 30, 2017.
- Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Machine Learning, pages 3076–3085. PMLR, 2017.
- Deconfounding with networked observational data in a dynamic environment. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining, pages 166–174, 2021.
- Estimating causal effects on networked observational data via representation learning. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, pages 852–861, 2022.
- Learning causal effects on hypergraphs. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 1202–1212, 2022.
- E. Michael SOBEL. What do randomized studies of housing mobility demonstrate?: Causal inference in the face of interference. Journal of the American Statistical Association, 2006.
- Learning from ambiguously labeled examples. Intelligent Data Analysis, 10(5):419–439, 2006.
- A regularization approach for instance-based superset label learning. IEEE transactions on cybernetics, 48(3):967–978, 2017.
- Solving the partial label learning problem: An instance-based approach. In IJCAI, pages 4048–4054, 2015.
- Partial label learning via feature-aware disambiguation. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1335–1344, 2016.
- Semi-supervised multi-label learning with incomplete labels. IJCAI, pages 4062–4068, 2015.
- A light cnn for deep face representation with noisy labels. IEEE Transactions on Information Forensics and Security, pages 2884–2896, 2018.
- Geometric deep learning: going beyond euclidean data. IEEE Signal Processing Magazine, 34(4):18–42, 2017.
- Deep learning on graphs: A survey. IEEE Transactions on Knowledge and Data Engineering, 2020.
- Community preserving network embedding. In Proceedings of the AAAI conference on artificial intelligence, volume 31, 2017.
- A graph neural network framework for social recommendations. IEEE Transactions on Knowledge and Data Engineering, 34(5):2033–2047, 2020.
- Graph neural networks and their current applications in bioinformatics. Frontiers in genetics, 12:690049, 2021.
- Spectral networks and locally connected networks on graphs. In Proceedings of the 3rd International Conference on Learning Representations, 2014.
- Convolutional neural networks on graphs with fast localized spectral filtering. In Advances in Neural Information Processing Systems, pages 3844–3852, 2016.
- Semi-supervised classification with graph convolutional networks. In Proceedings of the 6th International Conference on Learning Representations, 2017.
- Neural message passing for quantum chemistry. In International Conference on Machine Learning, pages 1263–1272, 2017.
- Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems, pages 1024–1034, 2017.
- Should graph convolution trust neighbors? a simple causal inference method. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1208–1218, 2021.
- Causal attention for interpretable and generalizable graph classification. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 1696–1705, 2022.
- Causal incremental graph convolution for recommender system retraining. IEEE Transactions on Neural Networks and Learning Systems, 2022.
- A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems, 32(1):4–24, 2020.
- Geniepath: Graph neural networks with adaptive receptive paths. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 4424–4431, 2019.
- A survey of deep neural network architectures and their applications. Neurocomputing, 234:11–26, 2017.
- Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pages 249–256, 2010.
- Agl: a scalable system for industrial-purpose graph machine learning. arXiv preprint arXiv:2003.02454, 2020.