Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents (1902.11212v1)

Published 28 Feb 2019 in cs.GT and cs.MA

Abstract: Real-time bidding, as one of the most popular mechanisms for selling online ad slots, facilitates advertisers to reach their potential customers. The goal of bidding optimization is to maximize the advertisers' return on investment (ROI) under a certain budget setting. A straightforward solution is to model the bidding function in an explicit form. However, the static functional solutions lack generality in practice and are insensitive to the stochastic behaviour of other bidders in the environment. In this paper, we propose a general multi-agent framework with actor-critic solutions facing against playing imperfect information games. We firstly introduce a novel Deep Attentive Survival Analysis (DASA) model to infer the censored data in the second price auctions which outperforms start-of-the-art survival analysis. Furthermore, our approach introduces the DASA model as the opponent model into the policy learning process for each agent and develop a mean field equilibrium analysis of the second price auctions. The experiments have shown that with the inference of the market, the market converges to the equilibrium much faster while playing against both fixed strategy agents and dynamic learning agents.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Manxing Du (2 papers)
  2. Alexander I. Cowen-Rivers (13 papers)
  3. Ying Wen (75 papers)
  4. Phu Sakulwongtana (2 papers)
  5. Jun Wang (991 papers)
  6. Mats Brorsson (12 papers)
  7. Radu State (44 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.