Efficient Online Learning of Optimal Rankings: Dimensionality Reduction via Gradient Descent

11/05/2020
by   Dimitris Fotakis, et al.
0

We consider a natural model of online preference aggregation, where sets of preferred items R_1, R_2, …, R_t along with a demand for k_t items in each R_t, appear online. Without prior knowledge of (R_t, k_t), the learner maintains a ranking π_t aiming that at least k_t items from R_t appear high in π_t. This is a fundamental problem in preference aggregation with applications to, e.g., ordering product or news items in web pages based on user scrolling and click patterns. The widely studied Generalized Min-Sum-Set-Cover (GMSSC) problem serves as a formal model for the setting above. GMSSC is NP-hard and the standard application of no-regret online learning algorithms is computationally inefficient, because they operate in the space of rankings. In this work, we show how to achieve low regret for GMSSC in polynomial-time. We employ dimensionality reduction from rankings to the space of doubly stochastic matrices, where we apply Online Gradient Descent. A key step is to show how subgradients can be computed efficiently, by solving the dual of a configuration LP. Using oblivious deterministic and randomized rounding schemes, we map doubly stochastic matrices back to rankings with a small loss in the GMSSC objective.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2019

Online-Learning for min-max discrete problems

We study various discrete nonlinear combinatorial optimization problems ...
research
06/06/2018

TopRank: A practical algorithm for online stochastic ranking

Online learning to rank is a sequential decision-making problem where in...
research
12/09/2020

Learning from User Interactions with Rankings: A Unification of the Field

Ranking systems form the basis for online search engines and recommendat...
research
09/11/2022

An Improved Algorithm For Online Reranking

We study a fundamental model of online preference aggregation, where an ...
research
10/23/2022

No-Regret Learning in Two-Echelon Supply Chain with Unknown Demand Distribution

Supply chain management (SCM) has been recognized as an important discip...
research
06/13/2011

Efficient Transductive Online Learning via Randomized Rounding

Most traditional online learning algorithms are based on variants of mir...
research
11/26/2017

Balancing Speed and Quality in Online Learning to Rank for Information Retrieval

In Online Learning to Rank (OLTR) the aim is to find an optimal ranking ...

Please sign up or login with your details

Forgot password? Click here to reset