Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising

02/27/2018
by   Junqi Jin, et al.
0

Real-time advertising allows advertisers to bid for each impression for a visiting user. To optimize a specific goal such as maximizing the revenue led by ad placements, advertisers not only need to estimate the relevance between the ads and user's interests, but most importantly require a strategic response with respect to other advertisers bidding in the market. In this paper, we formulate bidding optimization with multi-agent reinforcement learning. To deal with a large number of advertisers, we propose a clustering method and assign each cluster with a strategic bidding agent. A practical Distributed Coordinated Multi-Agent Bidding (DCMAB) has been proposed and implemented to balance the tradeoff between the competition and cooperation among advertisers. The empirical study on our industry-scaled real-world data has demonstrated the effectiveness of our modeling methods. Our results show that a cluster based bidding would largely outperform single-agent and bandit approaches, and the coordinated bidding achieves better overall objectives than the purely self-interested bidding agents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2021

A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising

In online advertising, auto-bidding has become an essential tool for adv...
research
09/10/2022

Cooperation and Competition: Flocking with Evolutionary Multi-Agent Reinforcement Learning

Flocking is a very challenging problem in a multi-agent system; traditio...
research
09/10/2018

A Multi-Agent Reinforcement Learning Method for Impression Allocation in Online Display Advertising

In online display advertising, guaranteed contracts and real-time biddin...
research
09/29/2018

M^3RL: Mind-aware Multi-agent Management Reinforcement Learning

Most of the prior work on multi-agent reinforcement learning (MARL) achi...
research
03/08/2023

Strategic Planning for Flexible Agent Availability in Large Taxi Fleets

In large-scale multi-agent systems like taxi fleets, individual agents (...
research
03/11/2022

Impression Allocation and Policy Search in Display Advertising

In online display advertising, guaranteed contracts and real-time biddin...
research
08/18/2017

LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

We present LADDER, the first deep reinforcement learning agent that can ...

Please sign up or login with your details

Forgot password? Click here to reset