Incentive-Aware Recommender Systems in Two-Sided Markets

11/23/2022
by   Xiaowu Dai, et al.
0

Online platforms in the Internet Economy commonly incorporate recommender systems that recommend arms (e.g., products) to agents (e.g., users). In such platforms, a myopic agent has a natural incentive to exploit, by choosing the best product given the current information rather than to explore various alternatives to collect information that will be used for other agents. We propose a novel recommender system that respects agents' incentives and enjoys asymptotically optimal performances expressed by the regret in repeated games. We model such an incentive-aware recommender system as a multi-agent bandit problem in a two-sided market which is equipped with an incentive constraint induced by agents' opportunity costs. If the opportunity costs are known to the principal, we show that there exists an incentive-compatible recommendation policy, which pools recommendations across a genuinely good arm and an unknown arm via a randomized and adaptive approach. On the other hand, if the opportunity costs are unknown to the principal, we propose a policy that randomly pools recommendations across all arms and uses each arm's cumulative loss as feedback for exploration. We show that both policies also satisfy an ex-post fairness criterion, which protects agents from over-exploitation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2023

Repeated Principal-Agent Games with Unobserved Agent Rewards and Perfect-Knowledge Agents

Motivated by a number of real-world applications from domains like healt...
research
02/11/2021

Regret, stability, and fairness in matching markets with bandit learners

We consider the two-sided matching market with bandit learners. In the s...
research
07/07/2020

Robust Multi-Agent Multi-Armed Bandits

There has been recent interest in collaborative multi-agent bandits, whe...
research
09/08/2022

Who Pays? Personalization, Bossiness and the Cost of Fairness

Fairness-aware recommender systems that have a provider-side fairness co...
research
05/19/2021

Incentivized Bandit Learning with Self-Reinforcing User Preferences

In this paper, we investigate a new multi-armed bandit (MAB) online lear...
research
09/07/2018

Cost Sharing in Two-Sided Markets

Motivated by the emergence of popular service-based two-sided markets wh...
research
01/29/2023

Recommender system as an exploration coordinator: a bounded O(1) regret algorithm for large platforms

On typical modern platforms, users are only able to try a small fraction...

Please sign up or login with your details

Forgot password? Click here to reset