Reinforcement Mechanism Design, with Applications to Dynamic Pricing in Sponsored Search Auctions

11/28/2017
by   Weiran Shen, et al.
0

In this study, we apply reinforcement learning techniques and propose what we call reinforcement mechanism design to tackle the dynamic pricing problem in sponsored search auctions. In contrast to previous game-theoretical approaches that heavily rely on rationality and common knowledge among the bidders, we take a data-driven approach, and try to learn, over repeated interactions, the set of optimal reserve prices. We implement our approach within the current sponsored search framework of a major search engine: we first train a buyer behavior model, via a real bidding data set, that accurately predicts bids given information that bidders are aware of, including the game parameters disclosed by the search engine, as well as the bidders' KPI data from previous rounds. We then put forward a reinforcement/MDP (Markov Decision Process) based algorithm that optimizes reserve prices over time, in a GSP-like auction. Our simulations demonstrate that our framework outperforms static optimization strategies including the ones that are currently in use, as well as several other dynamic ones.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2014

A Game-theoretic Machine Learning Approach for Revenue Maximization in Sponsored Search

Sponsored search is an important monetization channel for search engines...
research
05/05/2022

Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning

Dynamic mechanism design has garnered significant attention from both co...
research
11/22/2019

Analysis of Evolutionary Behavior in Self-Learning Media Search Engines

The diversity of intrinsic qualities of multimedia entities tends to imp...
research
03/06/2018

An Online Algorithm for Learning Buyer Behavior under Realistic Pricing Restrictions

We propose a new efficient online algorithm to learn the parameters gove...
research
09/09/2021

Deep Reinforcement Learning for Equal Risk Pricing and Hedging under Dynamic Expectile Risk Measures

Recently equal risk pricing, a framework for fair derivative pricing, wa...
research
12/06/2022

Learning with Opponent Modeling in Repeated Auctions

We design an algorithm to learn bidding strategies in repeated auctions....
research
06/27/2019

Playing Adaptively Against Stealthy Opponents: A Reinforcement Learning Strategy for the FlipIt Security Game

A rise in Advanced Persistant Threats (APTs) has introduced a need for r...

Please sign up or login with your details

Forgot password? Click here to reset