- The paper introduces DPSR, a deep learning system that improves e-commerce search by effectively modeling user intent and semantic relevance.
- It employs a two-tower neural network with an attention-based loss function and leverages extensive click log data for robust candidate retrieval and ranking.
- Online evaluations at JD.com demonstrate significant improvements in conversion rates for long-tail queries, validating the system’s practical impact.
Overview of DPSR System
The JD.com research team has developed an innovative solution, designated as DPSR (Deep Personalized and Semantic Retrieval), which addresses significant e-commerce search challenges. The main hurdles faced in current search environments are twofold: retrieving items that may not exactly match the search query terms but are semantically relevant and tailoring search results to the unique profiles of individual users.
System Design
DPSR is architected to effectively serve large-scale e-commerce traffic and is trained to learn item and query semantics with high accuracy. The system comprises three fundamental components: a query processing unit that interprets user queries, a candidate retrieval mechanism using sophisticated indexing, and a ranking system sorting these candidates based on various metrics.
The focus of the team's effort lies in enhancing the candidate retrieval stage, with the aim of delivering more personalized and semantically relevant search results. This is particularly crucial for e-commerce platforms like JD.com, which need precise and individualized search functionalities to serve millions of active users and process transactions amounting to billions of dollars.
Methodology and Innovations
The researchers developed a neural network model utilizing a two-tower architecture, multi-head query tower design, attention-based loss function, and a negative sampling approach. This model is trained using significant human supervision and large-scale click log data from the platform. For implementation, the team significantly customized TensorFlow, a popular machine learning framework, to ensure consistency in online and offline scenarios, optimize input data storage, and facilitate scalable distributed training.
Results and Deployment
The DPSR system has been methodically evaluated both offline and online. In offline experiments, the system demonstrated superior capabilities to retrieve semantically relevant items. It particularly excelled in handling long-tail queries, which pose a challenge for traditional retrieval systems. Online A/B tests conducted in live environments showcased that DPSR could improve the e-commerce platform's search experience, leading to a marked increase in conversion rates for long-tail queries.
Given the successful deployment of DPSR within JD.com’s search functionality since 2019, the solution presents an advanced example of neural network-based retrieval in action. It stands as an embodiment of cutting-edge e-commerce search technology, capable of nuanced semantic understanding and individual user catering.