LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions

08/18/2017
by   Yu Wang, et al.
0

We present LADDER, the first deep reinforcement learning agent that can successfully learn control policies for large-scale real-world problems directly from raw inputs composed of high-level semantic information. The agent is based on an asynchronous stochastic variant of DQN (Deep Q Network) named DASQN. The inputs of the agent are plain-text descriptions of states of a game of incomplete information, i.e. real-time large scale online auctions, and the rewards are auction profits of very large scale. We apply the agent to an essential portion of JD's online RTB (real-time bidding) advertising business and find that it easily beats the former state-of-the-art bidding policy that had been carefully engineered and calibrated by human experts: during JD.com's June 18th anniversary sale, the agent increased the company's ads revenue from the portion by more than 50 also improved significantly.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/18/2015

Distributed Deep Q-Learning

We propose a distributed deep learning model to successfully learn contr...
research
03/01/2018

Deep Reinforcement Learning for Sponsored Search Real-time Bidding

Bidding optimization is one of the most critical problems in online adve...
research
11/01/2021

Human-Level Control without Server-Grade Hardware

Deep Q-Network (DQN) marked a major milestone for reinforcement learning...
research
09/10/2018

A Multi-Agent Reinforcement Learning Method for Impression Allocation in Online Display Advertising

In online display advertising, guaranteed contracts and real-time biddin...
research
10/13/2020

Deep Reinforcement Learning for Real-Time Optimization of Pumps in Water Distribution Systems

Real-time control of pumps can be an infeasible task in water distributi...
research
02/27/2018

Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising

Real-time advertising allows advertisers to bid for each impression for ...
research
06/02/2021

Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play

Securing networked infrastructures is important in the real world. The p...

Please sign up or login with your details

Forgot password? Click here to reset