Learning Multiclass Classifier Under Noisy Bandit Feedback

06/05/2020
by   Mudit Agarwal, et al.
0

This paper addresses the problem of multiclass classification with corrupted or noisy bandit feedback. In this setting, the learner may not receive true feedback. Instead, it receives feedback that has been flipped with some non-zero probability. We propose a novel approach to deal with noisy bandit feedback, based on the unbiased estimator technique. We further propose an approach that can efficiently estimate the noise rates, and thus providing an end-to-end framework. The proposed algorithm enjoys mistake bound of the order of O(√(T)). We provide a theoretical mistake bound for our proposal. We also carry out extensive experiments on several benchmark datasets to demonstrate that our proposed approach successfully learns the underlying classifier even using noisy bandit feedbacks

READ FULL TEXT
research
05/17/2021

Multiclass Classification using dilute bandit feedback

This paper introduces a new online learning framework for multiclass cla...
research
10/11/2018

Online Multiclass Boosting with Bandit Feedback

We present online boosting algorithms for multiclass classification with...
research
08/08/2023

Multiclass Online Learnability under Bandit Feedback

We study online multiclass classification under bandit feedback. We exte...
research
01/12/2023

Thompson Sampling with Diffusion Generative Prior

In this work, we initiate the idea of using denoising diffusion models t...
research
07/05/2018

Contextual Bandits under Delayed Feedback

Delayed feedback is an ubiquitous problem in many industrial systems emp...
research
06/02/2016

Stochastic Structured Prediction under Bandit Feedback

Stochastic structured prediction under bandit feedback follows a learnin...
research
09/03/2022

Sharp bounds on the price of bandit feedback for several models of mistake-bounded online learning

We determine sharp bounds on the price of bandit feedback for several va...

Please sign up or login with your details

Forgot password? Click here to reset