Online Second Price Auction with Semi-bandit Feedback Under the Non-Stationary Setting

11/14/2019
by   Haoyu Zhao, et al.
0

In this paper, we study the non-stationary online second price auction problem. We assume that the seller is selling the same type of items in T rounds by the second price auction, and she can set the reserve price in each round. In each round, the bidders draw their private values from a joint distribution unknown to the seller. Then, the seller announced the reserve price in this round. Next, bidders with private values higher than the announced reserve price in that round will report their values to the seller as their bids. The bidder with the highest bid larger than the reserved price would win the item and she will pay to the seller the price equal to the second-highest bid or the reserve price, whichever is larger. The seller wants to maximize her total revenue during the time horizon T while learning the distribution of private values over time. The problem is more challenging than the standard online learning scenario since the private value distribution is non-stationary, meaning that the distribution of bidders' private values may change over time, and we need to use the non-stationary regret to measure the performance of our algorithm. To our knowledge, this paper is the first to study the repeated auction in the non-stationary setting theoretically. Our algorithm achieves the non-stationary regret upper bound Õ(min{√( S T), V̅^1/3T^2/3}), where S is the number of switches in the distribution, and V̅ is the sum of total variation, and S and V̅ are not needed to be known by the algorithm. We also prove regret lower bounds Ω(√( S T)) in the switching case and Ω(V̅^1/3T^2/3) in the dynamic case, showing that our algorithm has nearly optimal non-stationary regret.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2020

Distributionally Robust Pricing in Independent Private Value Auctions

A seller chooses a reserve price in a second-price auction to maximize w...
research
02/20/2023

Leveraging Reviews: Learning to Price with Buyer and Seller Uncertainty

In online marketplaces, customers have access to hundreds of reviews for...
research
06/08/2021

Learning to Price Against a Moving Target

In the Learning to Price setting, a seller posts prices over time with t...
research
03/22/2020

Optimal No-regret Learning in Repeated First-price Auctions

We study online learning in repeated first-price auctions with censored ...
research
05/27/2023

Online Learning in Multi-unit Auctions

We consider repeated multi-unit auctions with uniform pricing, which are...
research
07/09/2020

Learning to Bid Optimally and Efficiently in Adversarial First-price Auctions

First-price auctions have very recently swept the online advertising ind...
research
10/19/2022

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

We study reserve price optimization in multi-phase second price auctions...

Please sign up or login with your details

Forgot password? Click here to reset