Learning from a Learning User for Optimal Recommendations

02/03/2022
by   Fan Yao, et al.
0

In real-world recommendation problems, especially those with a formidably large item space, users have to gradually learn to estimate the utility of any fresh recommendations from their experience about previously consumed items. This in turn affects their interaction dynamics with the system and can invalidate previous algorithms built on the omniscient user assumption. In this paper, we formalize a model to capture such "learning users" and design an efficient system-side learning solution, coined Noise-Robust Active Ellipsoid Search (RAES), to confront the challenges brought by the non-stationary feedback from such a learning user. Interestingly, we prove that the regret of RAES deteriorates gracefully as the convergence rate of user learning becomes worse, until reaching linear regret when the user's learning fails to converge. Experiments on synthetic datasets demonstrate the strength of RAES for such a contemporaneous system-user learning problem. Our study provides a novel perspective on modeling the feedback loop in recommendation problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2020

Regret in Online Recommendation Systems

This paper proposes a theoretical analysis of recommendation systems in ...
research
07/25/2020

Feedback Loop and Bias Amplification in Recommender Systems

Recommendation algorithms are known to suffer from popularity bias; a fe...
research
11/06/2017

Regret Bounds and Regimes of Optimality for User-User and Item-Item Collaborative Filtering

We consider an online model for recommendation systems, with each user b...
research
05/18/2012

Online Structured Prediction via Coactive Learning

We propose Coactive Learning as a model of interaction between a learnin...
research
08/15/2023

Dynamic Embedding Size Search with Minimum Regret for Streaming Recommender System

With the continuous increase of users and items, conventional recommende...
research
02/15/2021

ELIXIR: Learning from User Feedback on Explanations to Improve Recommender Models

System-provided explanations for recommendations are an important compon...
research
11/03/2015

TribeFlow: Mining & Predicting User Trajectories

Which song will Smith listen to next? Which restaurant will Alice go to ...

Please sign up or login with your details

Forgot password? Click here to reset