Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
153 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multiple Instance Learning for Uplift Modeling (2312.09639v1)

Published 15 Dec 2023 in cs.LG and cs.AI

Abstract: Uplift modeling is widely used in performance marketing to estimate effects of promotion campaigns (e.g., increase of customer retention rate). Since it is impossible to observe outcomes of a recipient in treatment (e.g., receiving a certain promotion) and control (e.g., without promotion) groups simultaneously (i.e., counter-factual), uplift models are mainly trained on instances of treatment and control groups separately to form two models respectively, and uplifts are predicted by the difference of predictions from these two models (i.e., two-model method). When responses are noisy and the treatment effect is fractional, induced individual uplift predictions will be inaccurate, resulting in targeting undesirable customers. Though it is impossible to obtain the ideal ground-truth individual uplifts, known as Individual Treatment Effects (ITEs), alternatively, an average uplift of a group of users, called Average Treatment Effect (ATE), can be observed from experimental deliveries. Upon this, similar to Multiple Instance Learning (MIL) in which each training sample is a bag of instances, our framework sums up individual user uplift predictions for each bag of users as its bag-wise ATE prediction, and regularizes it to its ATE label, thus learning more accurate individual uplifts. Additionally, to amplify the fractional treatment effect, bags are composed of instances with adjacent individual uplift predictions, instead of random instances. Experiments conducted on two datasets show the effectiveness and universality of the proposed framework.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. Normal approximation for stochastic gradient descent via non-asymptotic rates of martingale CLT. In Conference on Learning Theory. PMLR, 115–137.
  2. Susan Athey and Guido W Imbens. 2015. Machine learning methods for estimating heterogeneous causal effects. stat 1050, 5 (2015), 1–26.
  3. MBSTAR: multiple instance learning for predicting specific functional binding sites in microRNA targets. Scientific reports 5, 1 (2015), 1–12.
  4. Uplift prediction with dependent feature representation in imbalanced treatment and control conditions. In International Conference on Neural Information Processing. Springer, 47–57.
  5. Treatment Targeting by AUUC Maximization with Generalization Guarantees. (2020).
  6. Uplift modeling with generalization guarantees. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 55–65.
  7. Compact Multiple-Instance Learning. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (Singapore, Singapore) (CIKM ’17). Association for Computing Machinery, New York, NY, USA, 2007–2010. https://doi.org/10.1145/3132847.3133070
  8. Statistical inference for model parameters in stochastic gradient descent. arXiv preprint arXiv:1610.08637 (2016).
  9. David Maxwell Chickering and David Heckerman. 2013. A decision theoretic approach to targeted advertising. arXiv preprint arXiv:1301.3842 (2013).
  10. Learning to Rank for Uplift Modeling. IEEE Transactions on Knowledge and Data Engineering (2020), 1–1. https://doi.org/10.1109/TKDE.2020.3048510
  11. A large scale benchmark for uplift modeling. In KDD.
  12. Solving the multiple instance problem with axis-parallel rectangles. Artificial intelligence 89, 1-2 (1997), 31–71.
  13. Ji Feng and Zhi-Hua Zhou. 2017. Deep MIML network. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 31.
  14. Multiple-Instance Learning from Similar and Dissimilar Bags. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 374–382.
  15. Random forests for uplift modeling: an insurance customer retention case. In International Conference on Modeling and Simulation in Engineering, Economics and Management. Springer, 123–133.
  16. Pierre Gutierrez and Jean-Yves Gérardy. 2017. Causal inference and uplift modelling: A review of the literature. In International Conference on Predictive Applications and APIs. PMLR, 1–13.
  17. Behram Hansotia and Brad Rukstales. 2002. Incremental value modeling. Journal of Interactive Marketing 16, 3 (2002), 35.
  18. Tobias Hatt and Stefan Feuerriegel. 2021. Estimating Average Treatment Effects via Orthogonal Regularization. Association for Computing Machinery, New York, NY, USA, 680–689. https://doi.org/10.1145/3459637.3482339
  19. Attention-based deep multiple instance learning. In International conference on machine learning. PMLR, 2127–2136.
  20. Maciej Jaskowski and Szymon Jaroszewicz. 2012. Uplift modeling for clinical trial data. In ICML Workshop on Clinical Data Analysis, Vol. 46.
  21. From group to individual labels using deep features. In Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. 597–606.
  22. Key instance detection in multi-instance learning. In Asian Conference on Machine Learning. PMLR, 253–268.
  23. Victor S. Y. Lo. 2002. The True Lift Model: A Novel Data Mining Approach to Response Modeling in Database Marketing. SIGKDD Explor. Newsl. (2002).
  24. Causal Effect Inference with Deep Latent-Variable Models. In Proceedings of the 31st International Conference on Neural Information Processing Systems (Long Beach, California, USA) (NIPS’17). Curran Associates Inc., Red Hook, NY, USA, 6449–6459.
  25. Weakly-supervised action localization with expectation-maximization multi-instance learning. In European conference on computer vision. Springer, 729–745.
  26. Yangling Ma and Zhouwang Yang. 2021. Multi-Instance Learning by Utilizing Structural Relationship among Instances. arXiv preprint arXiv:2102.01889 (2021).
  27. Nikolaos Pappas and Andrei Popescu-Belis. 2014. Explaining the stars: Weighted multiple-instance learning for aspect-based sentiment analysis. In Proceedings of the 2014 Conference on Empirical Methods In Natural Language Processing (EMNLP). 455–466.
  28. A joint optimization of incrementality and revenue to satisfy both advertiser and publisher. In Proceedings of the 22nd International Conference on World Wide Web. ACM. https://doi.org/10.1145/2487788.2487846
  29. Minlong Peng and Qi Zhang. 2019. Address instance-level label prediction in multiple instance learning. arXiv preprint arXiv:1905.12226 (2019).
  30. Nicholas Radcliffe and Patrick Surry. 1999. Differential response analysis: Modeling true responses by isolating the effect of a single action. Credit Scoring and Credit Control IV (1999).
  31. Nicholas J Radcliffe and Patrick D Surry. 2011. Real-world uplift modelling with significance-based uplift trees. White Paper TR-2011-1, Stochastic Solutions (2011), 1–33.
  32. Piotr Rzepakowski and Szymon Jaroszewicz. 2010. Decision trees for uplift modeling. In 2010 IEEE International Conference on Data Mining. IEEE, 441–450.
  33. Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Machine Learning. PMLR, 3076–3085.
  34. AMIL: Adversarial multi-instance learning for human pose estimation. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM) 16, 1s (2020), 1–23.
  35. Adapting Neural Networks for the Estimation of Treatment Effects. Curran Associates Inc., Red Hook, NY, USA.
  36. Ensemble methods for uplift modeling. Data mining and knowledge discovery 29, 6 (2015), 1531–1559.
  37. Exploring Features for Complicated Objects: Cross-View Feature Selection for Multi-Instance Learning. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (Shanghai, China) (CIKM ’14). Association for Computing Machinery, New York, NY, USA, 1699–1708. https://doi.org/10.1145/2661829.2662041
  38. Uplift modeling from separate labels. arXiv preprint arXiv:1803.05112 (2018).
  39. GANITE: Estimation of individualized treatment effects using generative adversarial nets. In International Conference on Learning Representations.
  40. Łukasz Zaniewicz and Szymon Jaroszewicz. 2013. Support vector machines for uplift modeling. In 2013 IEEE 13th International Conference on Data Mining Workshops. IEEE, 131–138.
  41. Qi Zhang and Sally A Goldman. 2001. EM-DD: An improved multiple-instance learning technique. In Advances in neural information processing systems. 1073–1080.
  42. A practically competitive and provably consistent algorithm for uplift modeling. In 2017 IEEE International Conference on Data Mining (ICDM). IEEE, 1171–1176.
  43. Deep multi-instance networks with sparse label assignment for whole mammogram classification. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 603–611.
Citations (2)

Summary

We haven't generated a summary for this paper yet.