Unbiased Filtering Of Accidental Clicks in Verizon Media Native Advertising (2312.05017v1)
Abstract: Verizon Media (VZM) native advertising is one of VZM largest and fastest growing businesses, reaching a run-rate of several hundred million USDs in the past year. Driving the VZM native models that are used to predict event probabilities, such as click and conversion probabilities, is OFFSET - a feature enhanced collaborative-filtering based event-prediction algorithm. In this work we focus on the challenge of predicting click-through rates (CTR) when we are aware that some of the clicks have short dwell-time and are defined as accidental clicks. An accidental click implies little affinity between the user and the ad, so predicting that similar users will click on the ad is inaccurate. Therefore, it may be beneficial to remove clicks with dwell-time lower than a predefined threshold from the training set. However, we cannot ignore these positive events, as filtering these will cause the model to under predict. Previous approaches have tried to apply filtering and then adding corrective biases to the CTR predictions, but did not yield revenue lifts and therefore were not adopted. In this work, we present a new approach where the positive weight of the accidental clicks is distributed among all of the negative events (skips), based on their likelihood of causing accidental clicks, as predicted by an auxiliary model. These likelihoods are taken as the correct labels of the negative events, shifting our training from using only binary labels and adopting a binary cross-entropy loss function in our training process. After showing offline performance improvements, the modified model was tested online serving VZM native users, and provided 1.18% revenue lift over the production model which is agnostic to accidental clicks.
- Off-set: one-pass factorization of feature sets for online recommendation in persistent cold start settings. In Proc. of RecSys’2013, pages 375–378, 2013.
- Adaptive online hyper-parameters tuning for ad event-prediction models. In Proc. of WWW’2017, pages 672–679, 2017.
- Soft frequency capping for improved ad click prediction in yahoo gemini native. In Proc. of CIKM’2019, pages 2793–2801, 2019.
- Carousel ads optimization in yahoo gemini native. In Proc. of KDD’2019, pages 1993–2001, 2019.
- Build your own music recommender by modeling internet radio streams. In Proc. of WWW’2012, pages 1–10, 2012.
- Feature enhancement via user similarities networks for improved click prediction in yahoo gemini native. In Proc. of CIKM’2019, pages 2557–2565, 2019.
- On the optimality of conditional expectation as a bregman predictor. IEEE Transactions on Information Theory, 51(7):2664–2669, 2005.
- Lessons from the netflix prize challenge. ACM SIGKDD Explorations Newsletter, 9(2):75–79, 2007.
- L.M. Bregman. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming. USSR Computational Mathematics and Mathematical Physics, 7(3):200 – 217, 1967.
- Adaptive subgradient methods for online learning and stochastic optimization. Journal of machine learning research (JMLR), 12(7):2121–2159, 2011.
- Practical lessons from predicting clicks on ads at facebook. In Proc. of the 8th International Workshop on Data Mining for Online Advertising, pages 1–9, 2014.
- Field-aware factorization machines in a real-world online advertising system. In Proc. of WWW’2017, pages 680–688, 2017.
- Field-aware factorization machines for ctr prediction. In Proc. of RecSys’2016, pages 43–50, 2016.
- Matrix factorization techniques for recommender systems. Computer, 42(8):30–37, 2009.
- The cross entropy method for classification. In Proc. of the 22nd international conference on Machine learning, pages 561–568, 2005.
- Ad click prediction: a view from the trenches. In Proc. of KDD’2013, pages 1222–1230, 2013.
- Field-weighted factorization machines for click-through rate prediction in display advertising. In Proc. of WWW’2018, pages 1349–1357, 2018.
- Steffen Rendle. Factorization machines. In Proc. of IEEE International Conference on Data Mining (IDCM), pages 995–1000, 2010.
- Ad close mitigation for improved user experience in native advertisements. In Proc. of WSDM’2020, pages 546–554, 2020.
- Harald Steck. Evaluation of recommendations: rating-prediction and ranking. In Proc. of RecSys’2013, pages 213–220, 2013.
- You must have clicked on this ad by mistake! data-driven identification of accidental clicks on mobile ads with applications to advertiser cost discounting and click-through rate prediction. International Journal of Data Science and Analytics, 7(1):53–66, 2019.
- Doubly robust joint learning for recommendation on data missing not at random. In Proc. of ICML’2019, pages 6638–6647, 2019.
- Beyond clicks: dwell time for personalization. In Proc. of RecSys’2014, pages 113–120, 2014.
- Improving ad click prediction by considering non-displayed events. In Proc. of CIKM’2019, pages 329–338, 2019.