Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents

02/28/2019
by   Manxing Du, et al.
0

Real-time bidding, as one of the most popular mechanisms for selling online ad slots, facilitates advertisers to reach their potential customers. The goal of bidding optimization is to maximize the advertisers' return on investment (ROI) under a certain budget setting. A straightforward solution is to model the bidding function in an explicit form. However, the static functional solutions lack generality in practice and are insensitive to the stochastic behaviour of other bidders in the environment. In this paper, we propose a general multi-agent framework with actor-critic solutions facing against playing imperfect information games. We firstly introduce a novel Deep Attentive Survival Analysis (DASA) model to infer the censored data in the second price auctions which outperforms start-of-the-art survival analysis. Furthermore, our approach introduces the DASA model as the opponent model into the policy learning process for each agent and develop a mean field equilibrium analysis of the second price auctions. The experiments have shown that with the inference of the market, the market converges to the equilibrium much faster while playing against both fixed strategy agents and dynamic learning agents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2019

Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games

We study discrete-time mean-field Markov games with infinite numbers of ...
research
12/14/2019

Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

Multi-agent reinforcement learning has been successfully applied to a nu...
research
08/23/2018

Latent Dirichlet Allocation for Internet Price War

Internet market makers are always facing intense competitive environment...
research
06/23/2020

Calibration of Shared Equilibria in General Sum Partially Observable Markov Games

Training multi-agent systems (MAS) to achieve realistic equilibria gives...
research
10/13/2022

Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations

We study a game between liquidity provider and liquidity taker agents in...
research
06/04/2021

A Learning-based Optimal Market Bidding Strategy for Price-Maker Energy Storage

Load serving entities with storage units reach sizes and performances th...

Please sign up or login with your details

Forgot password? Click here to reset