Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Sequential Search with Off-Policy Reinforcement Learning (2202.00245v1)

Published 1 Feb 2022 in cs.IR and cs.LG

Abstract: Recent years have seen a significant amount of interests in Sequential Recommendation (SR), which aims to understand and model the sequential user behaviors and the interactions between users and items over time. Surprisingly, despite the huge success Sequential Recommendation has achieved, there is little study on Sequential Search (SS), a twin learning task that takes into account a user's current and past search queries, in addition to behavior on historical query sessions. The SS learning task is even more important than the counterpart SR task for most of E-commence companies due to its much larger online serving demands as well as traffic volume. To this end, we propose a highly scalable hybrid learning model that consists of an RNN learning framework leveraging all features in short-term user-item interactions, and an attention model utilizing selected item-only features from long-term interactions. As a novel optimization step, we fit multiple short user sequences in a single RNN pass within a training batch, by solving a greedy knapsack problem on the fly. Moreover, we explore the use of off-policy reinforcement learning in multi-session personalized search ranking. Specifically, we design a pairwise Deep Deterministic Policy Gradient model that efficiently captures users' long term reward in terms of pairwise classification error. Extensive ablation experiments demonstrate significant improvement each component brings to its state-of-the-art baseline, on a variety of offline and online metrics.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Dadong Miao (5 papers)
  2. Yanan Wang (68 papers)
  3. Guoyu Tang (12 papers)
  4. Lin Liu (190 papers)
  5. Sulong Xu (23 papers)
  6. Bo Long (60 papers)
  7. Yun Xiao (33 papers)
  8. Lingfei Wu (135 papers)
  9. Yunjiang Jiang (22 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.