Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

JDRec: Practical Actor-Critic Framework for Online Combinatorial Recommender System (2207.13311v1)

Published 27 Jul 2022 in cs.IR, cs.AI, and cs.LG

Abstract: A combinatorial recommender (CR) system feeds a list of items to a user at a time in the result page, in which the user behavior is affected by both contextual information and items. The CR is formulated as a combinatorial optimization problem with the objective of maximizing the recommendation reward of the whole list. Despite its importance, it is still a challenge to build a practical CR system, due to the efficiency, dynamics, personalization requirement in online environment. In particular, we tear the problem into two sub-problems, list generation and list evaluation. Novel and practical model architectures are designed for these sub-problems aiming at jointly optimizing effectiveness and efficiency. In order to adapt to online case, a bootstrap algorithm forming an actor-critic reinforcement framework is given to explore better recommendation mode in long-term user interaction. Offline and online experiment results demonstrate the efficacy of proposed JDRec framework. JDRec has been applied in online JD recommendation, improving click through rate by 2.6% and synthetical value for the platform by 5.03%. We will publish the large-scale dataset used in this study to contribute to the research community.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Xin Zhao (160 papers)
  2. Zhiwei Fang (50 papers)
  3. Yuchen Guo (70 papers)
  4. Jie He (50 papers)
  5. Wenlong Chen (15 papers)
  6. Changping Peng (18 papers)

Summary

We haven't generated a summary for this paper yet.