Incentive-aware Contextual Pricing with Non-parametric Market Noise

11/08/2019
by   Negin Golrezaei, et al.
7

We consider a dynamic pricing problem for repeated contextual second-price auctions with strategic buyers whose goals are to maximize their long-term time discounted utility. The seller has very limited information about buyers' overall demand curves, which depends on d-dimensional context vectors characterizing auctioned items, and a non-parametric market noise distribution that captures buyers' idiosyncratic tastes. The noise distribution and the relationship between the context vectors and buyers' demand curves are both unknown to the seller. We focus on designing the seller's learning policy to set contextual reserve prices where the seller's goal is to minimize his regret for revenue. We first propose a pricing policy when buyers are truthful and show that it achieves a T-period regret bound of Õ(√(dT)) against a clairvoyant policy that has full information of the buyers' demand. Next, under the setting where buyers bid strategically to maximize their long-term discounted utility, we develop a variant of our first policy that is robust to strategic (corrupted) bids. This policy incorporates randomized "isolation" periods, during which a buyer is randomly chosen to solely participate in the auction. We show that this design allows the seller to control the number of periods in which buyers significantly corrupt their bids. Because of this nice property, our robust policy enjoys a T-period regret of Õ(√(dT)), matching that under the truthful setting up to a constant factor that depends on the utility discount factor.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2020

Dynamic Incentive-aware Learning: Robust Pricing in Contextual Auctions

Motivated by pricing in ad exchange markets, we consider the problem of ...
research
10/19/2021

Dynamic pricing and assortment under a contextual MNL demand

We consider dynamic multi-product pricing and assortment problems under ...
research
10/22/2015

Inventory Control Involving Unknown Demand of Discrete Nonperishable Items - Analysis of a Newsvendor-based Policy

Inventory control with unknown demand distribution is considered, with e...
research
10/19/2022

A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

We study reserve price optimization in multi-phase second price auctions...
research
09/14/2017

Dynamic Pricing in Competitive Markets

Dynamic pricing of goods in a competitive environment to maximize revenu...
research
09/13/2021

Policy Optimization Using Semiparametric Models for Dynamic Pricing

In this paper, we study the contextual dynamic pricing problem where the...
research
07/17/2017

On consistency of optimal pricing algorithms in repeated posted-price auctions with strategic buyer

We study revenue optimization learning algorithms for repeated posted-pr...

Please sign up or login with your details

Forgot password? Click here to reset