TF4CTR: Twin Focus Framework for CTR Prediction via Adaptive Sample Differentiation (2405.03167v2)
Abstract: Effective feature interaction modeling is critical for enhancing the accuracy of click-through rate (CTR) prediction in industrial recommender systems. Most of the current deep CTR models resort to building complex network architectures to better capture intricate feature interactions or user behaviors. However, we identify two limitations in these models: (1) the samples given to the model are undifferentiated, which may lead the model to learn a larger number of easy samples in a single-minded manner while ignoring a smaller number of hard samples, thus reducing the model's generalization ability; (2) differentiated feature interaction encoders are designed to capture different interactions information but receive consistent supervision signals, thereby limiting the effectiveness of the encoder. To bridge the identified gaps, this paper introduces a novel CTR prediction framework by integrating the plug-and-play Twin Focus (TF) Loss, Sample Selection Embedding Module (SSEM), and Dynamic Fusion Module (DFM), named the Twin Focus Framework for CTR (TF4CTR). Specifically, the framework employs the SSEM at the bottom of the model to differentiate between samples, thereby assigning a more suitable encoder for each sample. Meanwhile, the TF Loss provides tailored supervision signals to both simple and complex encoders. Moreover, the DFM dynamically fuses the feature interaction information captured by the encoders, resulting in more accurate predictions. Experiments on five real-world datasets confirm the effectiveness and compatibility of the framework, demonstrating its capacity to enhance various representative baselines in a model-agnostic manner. To facilitate reproducible research, our open-sourced code and detailed running logs will be made available at: https://github.com/salmon1802/TF4CTR.
- J. Zhu, J. Liu, S. Yang, Q. Zhang, and X. He, “Open benchmarking for click-through rate prediction,” in Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 2021, pp. 2759–2769.
- J. Zhu, Q. Dai, L. Su, R. Ma, J. Liu, G. Cai, X. Xiao, and R. Zhang, “Bars: Towards open benchmarking for recommender systems,” in Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2022, pp. 2912–2923.
- R. Wang, R. Shivanna, D. Cheng, S. Jain, D. Lin, L. Hong, and E. Chi, “Dcnv2: Improved deep & cross network and practical lessons for web-scale learning to rank systems,” in Proceedings of the Web Conference 2021, 2021, pp. 1785–1797.
- H.-T. Cheng, L. Koc, J. Harmsen, T. Shaked, T. Chandra, H. Aradhye, G. Anderson, G. Corrado, W. Chai, M. Ispir et al., “Wide & deep learning for recommender systems,” in Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, 2016, pp. 7–10.
- B. Chen, Y. Wang, Z. Liu, R. Tang, W. Guo, H. Zheng, W. Yao, M. Zhang, and X. He, “Enhancing explicit and implicit feature interactions via information sharing for parallel deep ctr models,” in Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 2021, pp. 3757–3766.
- H. Guo, R. Tang, Y. Ye, Z. Li, and X. He, “Deepfm: A factorization-machine based neural network for ctr prediction,” in Proceedings of the 26th International Joint Conference on Artificial Intelligence, ser. IJCAI’17. AAAI Press, 2017, p. 1725–1731.
- R. Wang, B. Fu, G. Fu, and M. Wang, “Deep & cross network for ad click predictions,” in Proceedings of the ADKDD’17, 2017, pp. 1–7.
- X. He and T.-S. Chua, “Neural factorization machines for sparse predictive analytics,” in Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017, pp. 355–364.
- W. Cheng, Y. Shen, and L. Huang, “Adaptive factorization network: Learning adaptive-order feature interactions,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 04, 2020, pp. 3609–3616.
- Y. Qu, H. Cai, K. Ren, W. Zhang, Y. Yu, Y. Wen, and J. Wang, “Product-based neural networks for user response prediction,” in 2016 IEEE 16th International Conference on Data Mining (ICDM). IEEE, 2016, pp. 1149–1154.
- Y. Qu, B. Fang, W. Zhang, R. Tang, M. Niu, H. Guo, Y. Yu, and X. He, “Product-based neural networks for user response prediction over multi-field categorical data,” ACM Transactions on Information Systems (TOIS), vol. 37, no. 1, pp. 1–35, 2018.
- G. Zhou, X. Zhu, C. Song, Y. Fan, H. Zhu, X. Ma, Y. Yan, J. Jin, H. Li, and K. Gai, “Deep interest network for click-through rate prediction,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018, pp. 1059–1068.
- Y. Feng, F. Lv, W. Shen, M. Wang, F. Sun, Y. Zhu, and K. Yang, “Deep session interest network for click-through rate prediction,” in Proceedings of the 28th International Joint Conference on Artificial Intelligence, 2019, pp. 2301–2307.
- F. Wang, H. Gu, D. Li, T. Lu, P. Zhang, and N. Gu, “Towards deeper, lighter and interpretable cross network for ctr prediction,” in Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023, pp. 2523–2533.
- F. Wang, Y. Wang, D. Li, H. Gu, T. Lu, P. Zhang, and N. Gu, “Cl4ctr: A contrastive learning framework for ctr prediction,” in Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023, pp. 805–813.
- K. Mao, J. Zhu, L. Su, G. Cai, Y. Li, and Z. Dong, “Finalmlp: An enhanced two-stream mlp model for ctr prediction,” Proceedings of the AAAI Conference on Artificial Intelligence, 37(4), 4552-4560., 2023.
- J. Zhu, Q. Jia, G. Cai, Q. Dai, J. Li, Z. Dong, R. Tang, and R. Zhang, “Final: Factorized interaction layer for ctr prediction,” in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023, pp. 2006–2010.
- A. Shrivastava, A. Gupta, and R. Girshick, “Training region-based object detectors with online hard example mining,” in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2016, pp. 761–769.
- T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar, “Focal loss for dense object detection,” in Proceedings of the IEEE International Conference on Computer Vision (ICCV), Oct 2017, pp. 2980–2988.
- W. Song, C. Shi, Z. Xiao, Z. Duan, Y. Xu, M. Zhang, and J. Tang, “Autoint: Automatic feature interaction learning via self-attentive neural networks,” in Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019, pp. 1161–1170.
- Q. Liu, X. Hou, D. Lian, Z. Wang, H. Jin, J. Cheng, and J. Lei, “At4ctr: Auxiliary match tasks for enhancing click-through rate prediction,” in Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 8, 2024, pp. 8787–8795.
- Z. Li, Z. Cui, S. Wu, X. Zhang, and L. Wang, “Fignn: Modeling feature interactions via graph neural networks for ctr prediction,” in Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019, pp. 539–548.
- H. Li, L. Sang, Y. Zhang, X. Zhang, and Y. Zhang, “Cetn: Contrast-enhanced through network for ctr prediction,” arXiv preprint arXiv:2312.09715, 2023.
- H. Fei, J. Zhang, X. Zhou, J. Zhao, X. Qi, and P. Li, “Gemnn: Gating-enhanced multi-task neural networks with feature interaction learning for ctr prediction,” in Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021, pp. 2166–2171.
- J. Ma, Z. Zhao, X. Yi, J. Chen, L. Hong, and E. H. Chi, “Modeling task relationships in multi-task learning with multi-gate mixture-of-experts,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018, pp. 1930–1939.
- J. Lian, X. Zhou, F. Zhang, Z. Chen, X. Xie, and G. Sun, “xdeepfm: Combining explicit and implicit feature interactions for recommender systems,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018, pp. 1754–1763.
- Z. Tian, T. Bai, W. X. Zhao, J.-R. Wen, and Z. Cao, “Eulernet: Adaptive feature interaction learning via euler’s formula for ctr prediction,” in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023, p. 1376–1385.
- Barbara and Garcia-Molina, “The reliability of voting mechanisms,” IEEE Transactions on Computers, vol. C-36, no. 10, pp. 1197–1208, 1987.
- E. Jang, S. Gu, and B. Poole, “Categorical reparameterization with gumbel-softmax,” stat, vol. 1050, p. 5, 2017.
- L. Baltrunas, K. Church, A. Karatzoglou, and N. Oliver, “Frappe: Understanding the usage and perception of mobile app recommendations in-the-wild,” arXiv preprint arXiv:1505.03014, 2015.
- W. Wang, F. Feng, X. He, L. Nie, and T.-S. Chua, “Denoising implicit feedback for recommendation,” in Proceedings of the 14th ACM International Conference on Web Search and Data Mining, 2021, pp. 373–381.
- C. Zhu, P. Du, W. Zhang, Y. Yu, and Y. Cao, “Combo-fashion: Fashion clothes matching ctr prediction with item history,” in Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022, pp. 4621–4629.
- A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga et al., “Pytorch: An imperative style, high-performance deep learning library,” Advances in Neural Information Processing Systems, vol. 32, 2019.
- Huawei, “An open-source CTR prediction library,” https://fuxictr.github.io, 2021.
- D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.
- Z. Lin, J. Pan, S. Zhang, X. Wang, X. Xiao, S. Huang, L. Xiao, and J. Jiang, “Understanding the ranking loss for recommendation with sparse user feedback,” arXiv preprint arXiv:2403.14144, 2024.
- N. Mehrabi, F. Morstatter, N. Saxena, K. Lerman, and A. Galstyan, “A survey on bias and fairness in machine learning,” ACM Computing Surveys (CSUR), vol. 54, no. 6, pp. 1–35, 2021.
- W. Guo, C. Zhang, Z. He, J. Qin, H. Guo, B. Chen, R. Tang, X. He, and R. Zhang, “Miss: Multi-interest self-supervised learning framework for click-through rate prediction,” in 2022 IEEE 38th International Conference on Data Engineering (ICDE). IEEE, 2022, pp. 727–740.
- Y. Li, X. Guo, W. Lin, M. Zhong, Q. Li, Z. Liu, W. Zhong, and Z. Zhu, “Learning dynamic user interest sequence in knowledge graphs for click-through rate prediction,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 1, pp. 647–657, 2021.
- N. Xue, B. Liu, H. Guo, R. Tang, F. Zhou, S. Zafeiriou, Y. Zhang, J. Wang, and Z. Li, “Autohash: Learning higher-order feature interactions for deep ctr prediction,” IEEE Transactions on Knowledge and Data Engineering, vol. 34, no. 6, pp. 2653–2666, 2020.
- C. Zhu, B. Chen, W. Zhang, J. Lai, R. Tang, X. He, Z. Li, and Y. Yu, “Aim: Automatic interaction machine for click-through rate prediction,” IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 4, pp. 3389–3403, 2023.
- M. Gao, J.-Y. Li, C.-H. Chen, Y. Li, J. Zhang, and Z.-H. Zhan, “Enhanced multi-task learning and knowledge graph-based recommender system,” IEEE Transactions on Knowledge and Data Engineering, 2023.
- Z. Wang, Q. She, and J. Zhang, “Masknet: Introducing feature-wise multiplication to ctr ranking models by instance-guided mask,” arXiv preprint arXiv:2102.07619, 2021.
- J. Xiao, H. Ye, X. He, H. Zhang, F. Wu, and T.-S. Chua, “Attentional factorization machines: learning the weight of feature interactions via attention networks,” in Proceedings of the 26th International Joint Conference on Artificial Intelligence, 2017, pp. 3119–3125.
- F. Wang, Y. Wang, D. Li, H. Gu, T. Lu, P. Zhang, and N. Gu, “Enhancing ctr prediction with context-aware feature representation learning,” in Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2022, pp. 343–352.
- S. Rendle, W. Krichene, L. Zhang, and J. Anderson, “Neural collaborative filtering vs. matrix factorization revisited,” in Proceedings of the 14th ACM Conference on Recommender Systems, 2020, pp. 240–248.
- A. Bai, R. Jagerman, Z. Qin, L. Yan, P. Kar, B.-R. Lin, X. Wang, M. Bendersky, and M. Najork, “Regression compatible listwise objectives for calibrated ranking with binary relevance,” in Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023, pp. 4502–4508.
- X.-R. Sheng, J. Gao, Y. Cheng, S. Yang, S. Han, H. Deng, Y. Jiang, J. Xu, and B. Zheng, “Joint optimization of ranking and calibration with contextualized hybrid model,” in Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023, pp. 4813–4822.
- J. Yu, H. Yin, X. Xia, T. Chen, L. Cui, and Q. V. H. Nguyen, “Are graph augmentations necessary? simple graph contrastive learning for recommendation,” in Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2022, pp. 1294–1303.
- J. Yu, X. Xia, T. Chen, L. Cui, N. Q. V. Hung, and H. Yin, “Xsimgcl: Towards extremely simple graph contrastive learning for recommendation,” IEEE Transactions on Knowledge and Data Engineering, pp. 1–14, 2023.