Learning from Comparisons and Choices

04/24/2017
by   Sahand Negahban, et al.
0

When tracking user-specific online activities, each user's preference is revealed in the form of choices and comparisons. For example, a user's purchase history tracks her choices, i.e. which item was chosen among a subset of offerings. A user's comparisons are observed either explicitly as in movie ratings or implicitly as in viewing times of news articles. Given such individualized ordinal data, we address the problem of collaboratively learning representations of the users and the items. The learned features can be used to predict a user's preference of an unseen item to be used in recommendation systems. This also allows one to compute similarities among users and items to be used for categorization and search. Motivated by the empirical successes of the MultiNomial Logit (MNL) model in marketing and transportation, and also more recent successes in word embedding and crowdsourced image embedding, we pose this problem as learning the MNL model parameters that best explains the data. We propose a convex optimization for learning the MNL model, and show that it is minimax optimal up to a logarithmic factor by comparing its performance to a fundamental lower bound. This characterizes the minimax sample complexity of the problem, and proves that the proposed estimator cannot be improved upon other than by a logarithmic factor. Further, the analysis identifies how the accuracy depends on the topology of sampling via the spectrum of the sampling graph. This provides a guideline for designing surveys when one can choose which items are to be compared. This is accompanies by numerical simulations on synthetic and real datasets confirming our theoretical predictions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/21/2014

Minimax-optimal Inference from Partial Rankings

This paper studies the problem of inferring a global preference based on...
research
07/06/2020

The Sample Complexity of Best-k Items Selection from Pairwise Comparisons

This paper studies the sample complexity (aka number of comparisons) bou...
research
08/12/2018

PAC-Battling Bandits with Plackett-Luce: Tradeoff between Sample Complexity and Subset Size

We introduce the probably approximately correct (PAC) version of the pro...
research
04/27/2015

Spectral MLE: Top-K Rank Aggregation from Pairwise Comparisons

This paper explores the preference-based top-K rank aggregation problem....
research
05/10/2019

Active Embedding Search via Noisy Paired Comparisons

Suppose that we wish to estimate a user's preference vector w from paire...
research
06/04/2018

Online Reciprocal Recommendation with Theoretical Performance Guarantees

A reciprocal recommendation problem is one where the goal of learning is...
research
01/12/2011

Extracting Features from Ratings: The Role of Factor Models

Performing effective preference-based data retrieval requires detailed a...

Please sign up or login with your details

Forgot password? Click here to reset