Easy Learning from Label Proportions

We consider the problem of Learning from Label Proportions (LLP), a weakly supervised classification setup where instances are grouped into "bags", and only the frequency of class labels at each bag is available. Albeit, the objective of the learner is to achieve low task loss at an individual instance level. Here we propose Easyllp: a flexible and simple-to-implement debiasing approach based on aggregate labels, which operates on arbitrary loss functions. Our technique allows us to accurately estimate the expected loss of an arbitrary model at an individual level. We showcase the flexibility of our approach by applying it to popular learning frameworks, like Empirical Risk Minimization (ERM) and Stochastic Gradient Descent (SGD) with provable guarantees on instance level performance. More concretely, we exhibit a variance reduction technique that makes the quality of LLP learning deteriorate only by a factor of k (k being bag size) in both ERM and SGD setups, as compared to full supervision. Finally, we validate our theoretical results on multiple datasets demonstrating our algorithm performs as well or better than previous LLP approaches in spite of its simplicity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2019

Address Instance-level Label Prediction in Multiple Instance Learning

Multiple Instance Learning (MIL) is concerned with learning from bags of...
research
03/04/2022

Learning from Label Proportions by Learning with Label Noise

Learning from label proportions (LLP) is a weakly supervised classificat...
research
05/16/2023

Learning from Aggregated Data: Curated Bags versus Random Bags

Protecting user privacy is a major concern for many machine learning sys...
research
07/22/2021

Active Learning in Incomplete Label Multiple Instance Multiple Label Learning

In multiple instance multiple label learning, each sample, a bag, consis...
research
12/06/2018

Theoretical Guarantees of Deep Embedding Losses Under Label Noise

Collecting labeled data to train deep neural networks is costly and even...
research
07/11/2011

Multi-Instance Learning with Any Hypothesis Class

In the supervised learning setting termed Multiple-Instance Learning (MI...
research
10/07/2021

Fast learning from label proportions with small bags

In learning from label proportions (LLP), the instances are grouped into...

Please sign up or login with your details

Forgot password? Click here to reset