Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

102 tokens/sec

GPT-4o

59 tokens/sec

Gemini 2.5 Pro Pro

43 tokens/sec

o3 Pro

6 tokens/sec

GPT-4.1 Pro

50 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

Hi-Gen: Generative Retrieval For Large-Scale Personalized E-commerce Search (2404.15675v2)

Published 24 Apr 2024 in cs.IR

Abstract: Leveraging generative retrieval (GR) techniques to enhance search systems is an emerging methodology that has shown promising results in recent years. In GR, a text-to-text model maps string queries directly to relevant document identifiers (docIDs), dramatically simplifying the retrieval process. However, when applying most GR models in large-scale E-commerce for personalized item search, we must face two key problems in encoding and decoding. (1) Existing docID generation methods ignore the encoding of efficiency information, which is critical in E-commerce. (2) The positional information is important in decoding docIDs, while prior studies have not adequately discriminated the significance of positional information or well exploited the inherent interrelation among these positions. To overcome these problems, we introduce an efficient Hierarchical encoding-decoding Generative retrieval method (Hi-Gen) for large-scale personalized E-commerce search systems. Specifically, we first design a representation learning model using metric learning to learn discriminative feature representations of items to capture semantic relevance and efficiency information. Then, we propose a category-guided hierarchical clustering scheme that makes full use of the semantic and efficiency information of items to facilitate docID generation. Finally, we design a position-aware loss to discriminate the importance of positions and mine the inherent interrelation between different tokens at the same position. This loss boosts the performance of the LLM used in the decoding stage. Besides, we propose two variants of Hi-Gen (Hi-Gen-I2I and Hi-Gen-Cluster) to support online real-time large-scale recall in the online serving process. Hi-Gen gets 3.30% and 4.62% improvements over SOTA for Recall@1 on the public and industry datasets, respectively.

PDF HTML Abstract

Enhancing Large-Scale Personalized E-commerce Search with Hierarchical Generative Retrieval

Introduction

Recent advancements in Generative Retrieval (GR) techniques have shown promising directions for enhancing search systems, but applying these methods to large-scale E-commerce search systems introduces unique challenges in encoding and decoding document identifiers (docIDs). The paper presents a novel method, Hierarchical encoding-decoding Generative retrieval (Hi-Gen), which addresses several shortcomings of existing generative retrieval models in the context of large-scale personalized E-commerce searches.

Key Methodological Innovations

Representation Learning and Metric Learning:

Hi-Gen introduces a dual-component model to capture both semantic relevance and efficiency information of items:

Representation Learning: This component is responsible for generating discriminative embeddings that integrate semantic, common, and efficiency-based features of items using a multi-task learning framework.
Metric Learning: Post the initial embeddings, a metric learning module refines these embeddings further to ensure that similar items cluster closely in the embedded space, by employing a triplet margin loss that prioritizes both user interactions and item efficiency.

Category-Guided Hierarchical Clustering for docID Generation:

Rather than relying on traditional methods, Hi-Gen uses a unique category-guided hierarchical clustering to generate docIDs. This method leverages the hierarchical category structure inherent to E-commerce platforms to generate meaningful and structured identifiers encompassing both item category and efficiency metrics.

Decoding with Position-Aware Loss:

A position-aware loss function is employed to fine-tune the LLM that decodes user queries into docIDs. This loss function emphasizes the positional significance of tokens in the docIDs, aiming to capture the hierarchical importance and inter-token relationships more accurately.

Experimental Validation

Quantitative Improvements:

Hi-Gen demonstrates significant improvements over state-of-the-art models in well-known metrics. Specifically, the paper reports a 3.30% and 4.62% increase over existing best models in Recall@1 in public and industry-specific datasets, respectively. Larger gains are noted when exploring metrics like Recall@10, showcasing Hi-Gen's robustness in retrieving relevant documents.

Ablation Study:

Removing any single component from Hi-Gen (such as position-aware loss or metric learning) leads to notable performance dips, underlining the significance of each component in the integrated model. The category-guided clustering emerges as a particularly critical feature, its removal resulting in the most substantial performance decrease.

Zero-Shot and Scalability Tests:

Hi-Gen's generalization capabilities are compelling, particularly in zero-shot learning scenarios where it significantly outperforms other models. The adaptability of Hi-Gen is further validated through scalable online A/B testing on a large-scale E-commerce platform, demonstrating improvements in key business metrics such as Recall, GMV, and conversion rates without incurring additional latency overheads.

Implications and Future Work

The innovation of Hi-Gen lies in its holistic approach, covering feature representation, structured docID generation, and fine-tuning retrieval through position-aware mechanisms. This methodology could set a new standard for personalized search in massive, dynamic inventories typical of modern E-commerce platforms. Future explorations could refine the encoding strategies or adapt the model for other types of personalized recommendation systems, potentially exploring the integration with other forms of machine learning models to enhance decision-making processes further. Additionally, further elaboration on the impacts of different features and embedding layers could provide deeper insights into the contributions of individual model components.

PDF Markdown Bookmark Chat (Pro)

References (42)

Authors (7)

Yanjing Wu (1 paper)
Yinfu Feng (2 papers)
Jian Wang (967 papers)
Wenji Zhou (1 paper)
Yunan Ye (3 papers)
Rong Xiao (44 papers)
Jun Xiao (134 papers)

Tweets

https://twitter.com/_reachsumit/status/1783388401691246899