Learning to Persuade on the Fly: Robustness Against Ignorance

02/19/2021
by   You Zu, et al.
0

We study a repeated persuasion setting between a sender and a receiver, where at each time t, the sender observes a payoff-relevant state drawn independently and identically from an unknown prior distribution, and shares state information with the receiver, who then myopically chooses an action. As in the standard setting, the sender seeks to persuade the receiver into choosing actions that are aligned with the sender's preference by selectively sharing information about the state. However, in contrast to the standard models, the sender does not know the prior, and has to persuade while gradually learning the prior on the fly. We study the sender's learning problem of making persuasive action recommendations to achieve low regret against the optimal persuasion mechanism with the knowledge of the prior distribution. Our main positive result is an algorithm that, with high probability, is persuasive across all rounds and achieves O(√(Tlog T)) regret, where T is the horizon length. The core philosophy behind the design of our algorithm is to leverage robustness against the sender's ignorance of the prior. Intuitively, at each time our algorithm maintains a set of candidate priors, and chooses a persuasion scheme that is simultaneously persuasive for all of them. To demonstrate the effectiveness of our algorithm, we further prove that no algorithm can achieve regret better than Ω(√(T)), even if the persuasiveness requirements were significantly relaxed. Therefore, our algorithm achieves optimal regret for the sender's learning problem up to terms logarithmic in T.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2022

Sequential Information Design: Learning to Persuade in the Dark

We study a repeated information design problem faced by an informed send...
research
11/30/2021

Online Learning for Receding Horizon Control with Provable Regret Guarantees

We address the problem of learning to control an unknown linear dynamica...
research
10/28/2020

Provably Efficient Online Agnostic Learning in Markov Games

We study online agnostic learning, a problem that arises in episodic mul...
research
02/06/2023

Online Mechanism Design for Information Acquisition

We study the problem of designing mechanisms for information acquisition...
research
05/28/2021

Regret-Minimizing Bayesian Persuasion

We study a Bayesian persuasion setting with binary actions (adopt and re...
research
06/22/2020

Bandit algorithms: Letting go of logarithmic regret for statistical robustness

We study regret minimization in a stochastic multi-armed bandit setting ...

Please sign up or login with your details

Forgot password? Click here to reset