Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Generative Retrieval with Preference Optimization for E-commerce Search (2407.19829v2)

Published 29 Jul 2024 in cs.IR and cs.AI

Abstract: Generative retrieval introduces a groundbreaking paradigm to document retrieval by directly generating the identifier of a pertinent document in response to a specific query. This paradigm has demonstrated considerable benefits and potential, particularly in representation and generalization capabilities, within the context of LLMs. However, it faces significant challenges in E-commerce search scenarios, including the complexity of generating detailed item titles from brief queries, the presence of noise in item titles with weak language order, issues with long-tail queries, and the interpretability of results. To address these challenges, we have developed an innovative framework for E-commerce search, called generative retrieval with preference optimization. This framework is designed to effectively learn and align an autoregressive model with target data, subsequently generating the final item through constraint-based beam search. By employing multi-span identifiers to represent raw item titles and transforming the task of generating titles from queries into the task of generating multi-span identifiers from queries, we aim to simplify the generation process. The framework further aligns with human preferences using click data and employs a constrained search method to identify key spans for retrieving the final item, thereby enhancing result interpretability. Our extensive experiments show that this framework achieves competitive performance on a real-world dataset, and online A/B tests demonstrate the superiority and effectiveness in improving conversion gains.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Mingming Li (17 papers)
  2. Huimu Wang (6 papers)
  3. Zuxu Chen (2 papers)
  4. Guangtao Nie (3 papers)
  5. Yiming Qiu (37 papers)
  6. Guoyu Tang (12 papers)
  7. Lin Liu (190 papers)
  8. Jingwei Zhuo (12 papers)
Citations (2)
X Twitter Logo Streamline Icon: https://streamlinehq.com