Adversarial Counterfactual Learning and Evaluation for Recommender System

11/08/2020
by   Da Xu, et al.
0

The feedback data of recommender systems are often subject to what was exposed to the users; however, most learning and evaluation methods do not account for the underlying exposure mechanism. We first show in theory that applying supervised learning to detect user preferences may end up with inconsistent results in the absence of exposure information. The counterfactual propensity-weighting approach from causal inference can account for the exposure mechanism; nevertheless, the partial-observation nature of the feedback data can cause identifiability issues. We propose a principled solution by introducing a minimax empirical risk formulation. We show that the relaxation of the dual problem can be converted to an adversarial game between two recommendation models, where the opponent of the candidate model characterizes the underlying exposure mechanism. We provide learning bounds and conduct extensive simulation studies to illustrate and justify the proposed approach over a broad range of recommendation settings, which shed insights on the various benefits of the proposed approach.

READ FULL TEXT
research
10/16/2022

On the User Behavior Leakage from Recommender System Exposure

Modern recommender systems are trained to predict users potential future...
research
01/01/2020

Modeling and Counteracting Exposure Bias in Recommender Systems

What we discover and see online, and consequently our opinions and decis...
research
08/15/2022

Debiased Recommendation with Neural Stratification

Debiased recommender models have recently attracted increasing attention...
research
04/12/2018

Attention-based Group Recommendation

Recommender systems are widely used in big information-based companies s...
research
12/20/2020

Towards Fair Personalization by Avoiding Feedback Loops

Self-reinforcing feedback loops are both cause and effect of over and/or...
research
09/06/2020

Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

Counterfactual learning for dealing with missing-not-at-random data (MNA...
research
11/14/2018

A causal inference framework for cancer cluster investigations using publicly available data

Often, a community becomes alarmed when high rates of cancer are noticed...

Please sign up or login with your details

Forgot password? Click here to reset