Fast Rate Learning in Stochastic First Price Bidding

07/05/2021
by   Juliette Achddou, et al.
0

First-price auctions have largely replaced traditional bidding approaches based on Vickrey auctions in programmatic advertising. As far as learning is concerned, first-price auctions are more challenging because the optimal bidding strategy does not only depend on the value of the item but also requires some knowledge of the other bids. They have already given rise to several works in sequential learning, many of which consider models for which the value of the buyer or the opponents' maximal bid is chosen in an adversarial manner. Even in the simplest settings, this gives rise to algorithms whose regret grows as √(T) with respect to the time horizon T. Focusing on the case where the buyer plays against a stationary stochastic environment, we show how to achieve significantly lower regret: when the opponents' maximal bid distribution is known we provide an algorithm whose regret can be as low as log^2(T); in the case where the distribution must be learnt sequentially, a generalization of this algorithm can achieve T^1/3+ ϵ regret, for any ϵ>0. To obtain these results, we introduce two novel ideas that can be of interest in their own right. First, by transposing results obtained in the posted price setting, we provide conditions under which the first-price biding utility is locally quadratic around its optimum. Second, we leverage the observation that, on small sub-intervals, the concentration of the variations of the empirical distribution function may be controlled more accurately than by using the classical Dvoretzky-Kiefer-Wolfowitz inequality. Numerical simulations confirm that our algorithms converge much faster than alternatives proposed in the literature for various bid distributions, including for bids collected on an actual programmatic advertising platform.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2022

Leveraging the Hints: Adaptive Bidding in Repeated First-Price Auctions

With the advent and increasing consolidation of e-commerce, digital adve...
research
05/29/2022

No-regret Learning in Repeated First-Price Auctions with Budget Constraints

Recently the online advertising market has exhibited a gradual shift fro...
research
11/10/2020

Efficient Algorithms for Stochastic Repeated Second-price Auctions

Developing efficient sequential bidding strategies for repeated auctions...
research
02/22/2022

No-Regret Learning in Partially-Informed Auctions

Auctions with partially-revealed information about items are broadly emp...
research
11/03/2022

Phase Transitions in Learning and Earning under Price Protection Guarantee

Motivated by the prevalence of “price protection guarantee", which allow...
research
05/04/2022

Estimation of Standard Auction Models

We provide efficient estimation methods for first- and second-price auct...
research
09/28/2021

The Fragility of Optimized Bandit Algorithms

Much of the literature on optimal design of bandit algorithms is based o...

Please sign up or login with your details

Forgot password? Click here to reset