Sequential Relevance Maximization with Binary Feedback

03/06/2015
by   Vijay Kamble, et al.
0

Motivated by online settings where users can provide explicit feedback about the relevance of products that are sequentially presented to them, we look at the recommendation process as a problem of dynamically optimizing this relevance feedback. Such an algorithm optimizes the fine tradeoff between presenting the products that are most likely to be relevant, and learning the preferences of the user so that more relevant recommendations can be made in the future. We assume a standard predictive model inspired by collaborative filtering, in which a user is sampled from a distribution over a set of possible types. For every product category, each type has an associated relevance feedback that is assumed to be binary: the category is either relevant or irrelevant. Assuming that the user stays for each additional recommendation opportunity with probability β independent of the past, the problem is to find a policy that maximizes the expected number of recommendations that are deemed relevant in a session. We analyze this problem and prove key structural properties of the optimal policy. Based on these properties, we first present an algorithm that strikes a balance between recursion and dynamic programming to compute this policy. We further propose and analyze two heuristic policies: a `farsighted' greedy policy that attains at least 1-β factor of the optimal payoff, and a naive greedy policy that attains at least 1-β/1+β factor of the optimal payoff in the worst case. Extensive simulations show that these heuristics are very close to optimal in practice.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2022

Sparsity Regularization For Cold-Start Recommendation

Recently, Generative Adversarial Networks (GANs) have been applied to th...
research
04/29/2019

Dynamic Learning with Frequent New Product Launches: A Sequential Multinomial Logit Bandit Problem

Motivated by the phenomenon that companies introduce new products to kee...
research
10/19/2012

Active Collaborative Filtering

Collaborative filtering (CF) allows the preferences of multiple users to...
research
07/14/2016

Fifty Shades of Ratings: How to Benefit from a Negative Feedback in Top-N Recommendations Tasks

Conventional collaborative filtering techniques treat a top-n recommenda...
research
11/28/2022

Learning Recommendations from User Actions in the Item-poor Insurance Domain

While personalised recommendations are successful in domains like retail...
research
08/01/2023

Adaptive Collaborative Filtering with Personalized Time Decay Functions for Financial Product Recommendation

Classical recommender systems often assume that historical data are stat...
research
01/30/2022

Coordinated Attacks against Contextual Bandits: Fundamental Limits and Defense Mechanisms

Motivated by online recommendation systems, we propose the problem of fi...

Please sign up or login with your details

Forgot password? Click here to reset