Field-weighted Factorization Machines for Click-Through Rate Prediction in Display Advertising (1806.03514v2)

Published 9 Jun 2018 in cs.LG and stat.ML

Abstract: Click-through rate (CTR) prediction is a critical task in online display advertising. The data involved in CTR prediction are typically multi-field categorical data, i.e., every feature is categorical and belongs to one and only one field. One of the interesting characteristics of such data is that features from one field often interact differently with features from different other fields. Recently, Field-aware Factorization Machines (FFMs) have been among the best performing models for CTR prediction by explicitly modeling such difference. However, the number of parameters in FFMs is in the order of feature number times field number, which is unacceptable in the real-world production systems. In this paper, we propose Field-weighted Factorization Machines (FwFMs) to model the different feature interactions between different fields in a much more memory-efficient way. Our experimental evaluations show that FwFMs can achieve competitive prediction performance with only as few as 4% parameters of FFMs. When using the same number of parameters, FwFMs can bring 0.92% and 0.47% AUC lift over FFMs on two real CTR prediction data sets.

Citations (181)

View on Semantic Scholar

Summary

The paper presents FwFMs, a novel approach that uses a field pair weight matrix to model interactions efficiently for CTR prediction.
The method achieves high accuracy with only 4% of the parameters required by FFMs and offers up to a 0.9% AUC improvement.
Experiments on Criteo and Oath datasets demonstrate FwFMs’ superior performance over traditional models while maintaining practical memory efficiency.

Field-weighted Factorization Machines for Click-Through Rate Prediction

The paper presents a novel approach, Field-weighted Factorization Machines (FwFMs), for predicting click-through rates (CTR) in online display advertising. This task is critical due to its impact on optimizing advertising strategies and improving user engagement.

CTR Prediction and Data Challenges

CTR prediction relies heavily on analyzing multi-field categorical data, which poses several challenges. These include managing sparse data due to a large number of features, understanding complex feature interactions, and the need to efficiently handle potentially high model complexity in real-world applications. Traditionally, models like Logistic Regression (LR) fail to account for feature interaction nuances, while Polynomial-2 models provide a broader but less precise framework for interaction modeling. Factorization Machines (FMs) improve interaction modeling through embedding features, but they do not account for field-specific interactions.

Existing Models and Limitations

Field-aware Factorization Machines (FFMs) were developed to overcome the limitations of FMs by using field-specific embedding vectors to better capture interaction differences across fields. Despite their improved performance, FFMs suffer from excessive model complexity due to the high number of parameters, making them less viable for production systems where memory efficiency is paramount.

Introduction to FwFMs

Field-weighted Factorization Machines address these challenges by introducing a field pair weight matrix, allowing for the explicit modeling of varying interaction strengths between different fields in a memory-efficient manner. This weight matrix significantly reduces the number of parameters required, making FwFMs usable in production environments while offering competitive predictive performance. Notably, FwFMs achieve high accuracy with only 4% of the parameters needed by FFMs.

Experimental Findings

The experimental analysis provided in the paper compares FwFMs with existing models such as FMs, FFMs, LR, and Poly2 across two real-world CTR datasets—Criteo and Oath CTR datasets. When evaluated, FwFMs demonstrated superior performance over LR, Poly2, and FMs, and competitive results compared to FFMs, especially when considering model complexity and generalization capabilities. Specifically, FwFMs offered up to a 0.9% AUC improvement over FFMs when parameter counts were equalized.

Implications and Future Work

FwFMs represent a significant development in CTR prediction by efficiently capturing field-specific interactions, suggesting practical improvements for deployment in real-world advertising systems. The introduction of the field pair weight matrix highlights the importance of field interaction heterogeneity in machine learning models, providing a pathway for future algorithms to incorporate similarly specialized modeling approaches.

Furthermore, the paper suggests future extensions, such as integrating FwFMs with deep learning frameworks, which have shown promise in other prediction tasks. Exploring these directions could yield models that further enhance prediction accuracy and system responsiveness while maintaining efficient resource usage.

Overall, the contribution of FwFMs to the domain of computational advertising establishes a foundation for improved CTR prediction models that balance performance with operational feasibility.

PDF Markdown