Online Low Rank Matrix Completion

09/08/2022
by   Prateek Jain, et al.
3

We study the problem of online low-rank matrix completion with 𝖬 users, 𝖭 items and 𝖳 rounds. In each round, we recommend one item per user. For each recommendation, we obtain a (noisy) reward sampled from a low-rank user-item reward matrix. The goal is to design an online method with sub-linear regret (in 𝖳). While the problem can be mapped to the standard multi-armed bandit problem where each item is an independent arm, it leads to poor regret as the correlation between arms and users is not exploited. In contrast, exploiting the low-rank structure of reward matrix is challenging due to non-convexity of low-rank manifold. We overcome this challenge using an explore-then-commit (ETC) approach that ensures a regret of O(𝗉𝗈𝗅𝗒𝗅𝗈𝗀 (𝖬+𝖭) 𝖳^2/3). That is, roughly only 𝗉𝗈𝗅𝗒𝗅𝗈𝗀 (𝖬+𝖭) item recommendations are required per user to get non-trivial solution. We further improve our result for the rank-1 setting. Here, we propose a novel algorithm OCTAL (Online Collaborative filTering using iterAtive user cLustering) that ensures nearly optimal regret bound of O(𝗉𝗈𝗅𝗒𝗅𝗈𝗀 (𝖬+𝖭) 𝖳^1/2). Our algorithm uses a novel technique of clustering users and eliminating items jointly and iteratively, which allows us to obtain nearly minimax optimal rate in 𝖳.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2019

Adaptive Matrix Completion for the Users and the Items in Tail

Recommender systems are widely used to recommend the most appealing item...
research
01/17/2023

Optimal Algorithms for Latent Bandits with Cluster Structure

We consider the problem of latent bandits with cluster structure where t...
research
07/08/2022

Interactive Recommendations for Optimal Allocations in Markets with Constraints

Recommendation systems when employed in markets play a dual role: they a...
research
05/26/2018

Nonlinear Inductive Matrix Completion based on One-layer Neural Networks

The goal of a recommendation system is to predict the interest of a user...
research
12/01/2013

Stochastic continuum armed bandit problem of few linear parameters in high dimensions

We consider a stochastic continuum armed bandit problem where the arms a...
research
02/11/2023

Regret Guarantees for Adversarial Online Collaborative Filtering

We investigate the problem of online collaborative filtering under no-re...
research
01/08/2019

Bilinear Bandits with Low-rank Structure

We introduce the bilinear bandit problem with low-rank structure where a...

Please sign up or login with your details

Forgot password? Click here to reset