Learning with Abandonment

02/23/2018
by   Ramesh Johari, et al.
0

Consider a platform that wants to learn a personalized policy for each user, but the platform faces the risk of a user abandoning the platform if she is dissatisfied with the actions of the platform. For example, a platform is interested in personalizing the number of newsletters it sends, but faces the risk that the user unsubscribes forever. We propose a general thresholded learning model for scenarios like this, and discuss the structure of optimal policies. We describe salient features of optimal personalization algorithms and how feedback the platform receives impacts the results. Furthermore, we investigate how the platform can efficiently learn the heterogeneity across users by interacting with a population and provide performance guarantees.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/05/2021

Sequential Choice Bandits with Feedback for Personalizing users' experience

In this work, we study sequential choice bandits with feedback. We propo...
research
05/26/2023

Reputation-based Persuasion Platforms

In this paper, we introduce a two-stage Bayesian persuasion model in whi...
research
02/12/2022

Online Bayesian Recommendation with No Regret

We introduce and study the online Bayesian recommendation problem for a ...
research
08/22/2020

Fatigue-aware Bandits for Dependent Click Models

As recommender systems send a massive amount of content to keep users en...
research
07/24/2019

Counterfactual Learning from Logs for Improved Ranking of E-Commerce Products

Improved search quality enhances users' satisfaction, which directly imp...
research
01/07/2018

Authorization Policies and Co-Operating Strategies of DSCloud Platform

DSCloud Platform provides the global directory service to solve the prob...
research
03/22/2018

The Roots of Bias on Uber

In the last decade, there has been a growth in, what we call, digitally ...

Please sign up or login with your details

Forgot password? Click here to reset