Learning Classifiers under Delayed Feedback with a Time Window Assumption

09/28/2020
by   Masahiro Kato, et al.
0

We consider training a binary classifier under delayed feedback (DF Learning). In DF Learning, we first receive negative samples; subsequently, some samples turn positive. This problem is conceivable in various real-world applications such as online advertisements, where the user action takes place long after the first click. Owing to the delayed feedback, simply separating the positive and negative data causes a sample selection bias. One solution is to assume that a long time window after first observing a sample reduces the sample selection bias. However, existing studies report that only using a portion of all samples based on the time window assumption yields suboptimal performance, and the use of all samples along with the time window assumption improves empirical performance. Extending these existing studies, we propose a method with an unbiased and convex empirical risk constructed from the whole samples under the time window assumption. We provide experimental results to demonstrate the effectiveness of the proposed method using a real traffic log dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2021

Real Negatives Matter: Continuous Training with Real Negatives for Delayed Feedback Modeling

One of the difficulties of conversion rate (CVR) prediction is that the ...
research
03/18/2018

A Robust AUC Maximization Framework with Simultaneous Outlier Detection and Feature Selection for Positive-Unlabeled Classification

The positive-unlabeled (PU) classification is a common scenario in real-...
research
02/14/2022

Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction

Alleviating the delayed feedback problem is of crucial importance for th...
research
08/07/2018

Instance-Dependent PU Learning by Bayesian Optimal Relabeling

When learning from positive and unlabelled data, it is a strong assumpti...
research
01/29/2020

Binary Classification from Positive Data with Skewed Confidence

Positive-confidence (Pconf) classification [Ishida et al., 2018] is a pr...
research
03/08/2023

Automatic Debiased Learning from Positive, Unlabeled, and Exposure Data

We address the issue of binary classification from positive and unlabele...
research
01/15/2021

Ask Me or Tell Me? Enhancing the Effectiveness of Crowdsourced Design Feedback

Crowdsourced design feedback systems are emerging resources for getting ...

Please sign up or login with your details

Forgot password? Click here to reset