Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MoTiAC: Multi-Objective Actor-Critics for Real-Time Bidding (2002.07408v2)

Published 18 Feb 2020 in cs.AI and cs.LG

Abstract: Online Real-Time Bidding (RTB) is a complex auction game among which advertisers struggle to bid for ad impressions when a user request occurs. Considering display cost, Return on Investment (ROI), and other influential Key Performance Indicators (KPIs), large ad platforms try to balance the trade-off among various goals in dynamics. To address the challenge, we propose a Multi-ObjecTive Actor-Critics algorithm based on reinforcement learning (RL), named MoTiAC, for the problem of bidding optimization with various goals. In MoTiAC, objective-specific agents update the global network asynchronously with different goals and perspectives, leading to a robust bidding policy. Unlike previous RL models, the proposed MoTiAC can simultaneously fulfill multi-objective tasks in complicated bidding environments. In addition, we mathematically prove that our model will converge to Pareto optimality. Finally, experiments on a large-scale real-world commercial dataset from Tencent verify the effectiveness of MoTiAC versus a set of recent approaches

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Haolin Zhou (3 papers)
  2. Chaoqi Yang (17 papers)
  3. Xiaofeng Gao (53 papers)
  4. Qiong Chen (21 papers)
  5. Gongshen Liu (37 papers)
  6. Guihai Chen (74 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.