Incentivising Exploration and Recommendations for Contextual Bandits with Payments

01/22/2020
by   Priyank Agrawal, et al.
0

We propose a contextual bandit based model to capture the learning and social welfare goals of a web platform in the presence of myopic users. By using payments to incentivize these agents to explore different items/recommendations, we show how the platform can learn the inherent attributes of items and achieve a sublinear regret while maximizing cumulative social welfare. We also calculate theoretical bounds on the cumulative costs of incentivization to the platform. Unlike previous works in this domain, we consider contexts to be completely adversarial, and the behavior of the adversary is unknown to the platform. Our approach can improve various engagement metrics of users on e-commerce stores, recommendation engines and matching platforms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2020

Contextual User Browsing Bandits for Large-Scale Online Mobile Recommendation

Online recommendation services recommend multiple commodities to users. ...
research
01/23/2019

Thompson Sampling for a Fatigue-aware Online Recommendation System

In this paper we consider an online recommendation setting, where a plat...
research
03/05/2020

Stochastic Linear Contextual Bandits with Diverse Contexts

In this paper, we investigate the impact of context diversity on stochas...
research
06/02/2020

Maximizing Cumulative User Engagement in Sequential Recommendation: An Online Optimization Perspective

To maximize cumulative user engagement (e.g. cumulative clicks) in seque...
research
04/15/2021

Variational Inference for Category Recommendation in E-Commerce platforms

Category recommendation for users on an e-Commerce platform is an import...
research
02/03/2023

How Bad is Top-K Recommendation under Competing Content Creators?

Content creators compete for exposure on recommendation platforms, and s...
research
07/06/2018

Combinatorial Bandits for Incentivizing Agents with Dynamic Preferences

The design of personalized incentives or recommendations to improve user...

Please sign up or login with your details

Forgot password? Click here to reset