Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

E-commerce in Your Inbox: Product Recommendations at Scale (1606.07154v1)

Published 23 Jun 2016 in cs.AI, cs.IR, and cs.SI

Abstract: In recent years online advertising has become increasingly ubiquitous and effective. Advertisements shown to visitors fund sites and apps that publish digital content, manage social networks, and operate e-mail services. Given such large variety of internet resources, determining an appropriate type of advertising for a given platform has become critical to financial success. Native advertisements, namely ads that are similar in look and feel to content, have had great success in news and social feeds. However, to date there has not been a winning formula for ads in e-mail clients. In this paper we describe a system that leverages user purchase history determined from e-mail receipts to deliver highly personalized product ads to Yahoo Mail users. We propose to use a novel neural language-based algorithm specifically tailored for delivering effective product recommendations, which was evaluated against baselines that included showing popular products and products predicted based on co-occurrence. We conducted rigorous offline testing using a large-scale product purchase data set, covering purchases of more than 29 million users from 172 e-commerce websites. Ads in the form of product recommendations were successfully tested on online traffic, where we observed a steady 9% lift in click-through rates over other ad formats in mail, as well as comparable lift in conversion rates. Following successful tests, the system was launched into production during the holiday season of 2014.

Citations (304)

Summary

  • The paper introduces a neural language model that transforms email receipt data into personalized product recommendations, achieving a 9% increase in CTR and similar conversion lifts.
  • The paper details a robust machine learning pipeline processing data from over 29 million users and 2.1 million products, delivering real-time recommendations in under 200 milliseconds.
  • The paper tackles the cold-start challenge with a hybrid approach that combines personalized user data and demographic trends to enhance recommendation relevance across diverse user groups.

System for Scalable Product Recommendations in Yahoo Mail

The paper "E-commerce in Your Inbox: Product Recommendations at Scale" presents a structured approach to delivering personalized product recommendations by leveraging user purchase history extracted from e-mail receipts. This method was developed and implemented by researchers from Yahoo Labs and Yahoo Inc. in response to the need for enhanced advertisement formats within Yahoo Mail, a platform dealing with significant daily user traffic. Given the vast amounts of user engagement, their primary challenge was to incentivize user interaction with advertisements, specifically within the focused action of navigating an inbox, which inherently limits attention to ad placements.

The authors have proposed an innovative system that deploys a neural language-based algorithm for product recommendation. This method diverges from traditional approaches that either rely on popular product advertisements or merely suggest products based on co-occurrence in historical data. The paper details a robust machine learning pipeline applied to an extensive dataset encompassing over 29 million users and 2.1 million unique products, sourced from 172 e-commerce sites. These scalable solutions yielded a notable 9% improvement in click-through rates (CTR) and a comparable lift in conversion rates upon deployment in live traffic tests, highlighting the practical effectiveness of the neural network-generated recommendations.

The heart of the methodology revolves around adapting neural LLMs traditionally used in natural language processing tasks to process user purchase sequences in e-mail receipts. This is conceptualized through models termed as prod2vec and bagged-prod2vec. These models render product purchase sequences into low-dimensional vector spaces where similar purchase contexts are naturally clustered together. The clustering of such product vectors allows for transition probability modeling, further enhancing the diversity of recommendations by allowing cross-vendor product suggestions, subsequently improving predictive capabilities across vendors. Moreover, the proposed approach was benchmarked against a series of standard baselines, confirming its superior predictive performance through meticulous offline testing and live A/B testing.

Additionally, the paper addresses the cold-start problem inherent in product recommendation systems by deploying a hybrid approach that supplements user-specific suggestions with the global popularity of products within demographic cohorts. This dual approach effectively accommodates users with varied levels of historical purchase activity.

In terms of implications, the presented work offers a significant contribution to computational advertising and user engagement within e-mail services. It not only demonstrates an economically viable advertisement strategy for digital platforms but also sets a precedent for how user interaction data can be harnessed to align promotional content with user interests more effectively. Crucially, the infrastructure of deploying such models in near real-time conditions, with processing times under 200 milliseconds, showcases an advanced level of system optimization—paving the way for more sophisticated real-time recommender systems in the industry.

Looking forward, the system's potential can be further explored by integrating more refined user behavior signals, possibly incorporating additional contextual features and feedback mechanisms. The user and product interactions derived from other domains—social media, search habits, or even offline behavior—could present new layers of personalization and engagement. Moreover, future work might involve enhancing the dynamism of user profiles, enabling adaptive learning that accounts for evolving consumer trends and interests beyond historical purchases alone.

In conclusion, this paper elucidates a formalized and technically adept method for scaling personalized product recommendations in an e-mail client, achieving substantial gains in advertisement engagement and providing a substantial contribution to the domain of personalized advertising systems.

Youtube Logo Streamline Icon: https://streamlinehq.com