From Weakly Supervised Learning to Active Learning

09/23/2022
by   Vivien Cabannes, et al.
0

Applied mathematics and machine computations have raised a lot of hope since the recent success of supervised learning. Many practitioners in industries have been trying to switch from their old paradigms to machine learning. Interestingly, those data scientists spend more time scrapping, annotating and cleaning data than fine-tuning models. This thesis is motivated by the following question: can we derive a more generic framework than the one of supervised learning in order to learn from clutter data? This question is approached through the lens of weakly supervised learning, assuming that the bottleneck of data collection lies in annotation. We model weak supervision as giving, rather than a unique target, a set of target candidates. We argue that one should look for an “optimistic” function that matches most of the observations. This allows us to derive a principle to disambiguate partial labels. We also discuss the advantage to incorporate unsupervised learning techniques into our framework, in particular manifold regularization approached through diffusion techniques, for which we derived a new algorithm that scales better with input dimension then the baseline method. Finally, we switch from passive to active weakly supervised learning, introducing the “active labeling” framework, in which a practitioner can query weak information about chosen data. Among others, we leverage the fact that one does not need full information to access stochastic gradients and perform stochastic gradient descent.

READ FULL TEXT
research
10/07/2022

Label Propagation with Weak Supervision

Semi-supervised learning and weakly supervised learning are important pa...
research
05/26/2022

Active Labeling: Streaming Stochastic Gradients

The workhorse of machine learning is stochastic gradient descent. To acc...
research
04/14/2022

ULF: Unsupervised Labeling Function Correction using Cross-Validation for Weak Supervision

A way to overcome expensive and time-consuming manual data labeling is w...
research
06/02/2021

Weakly Supervised Learning Creates a Fusion of Modeling Cultures

The past two decades have witnessed the great success of the algorithmic...
research
02/04/2021

Disambiguation of weak supervision with exponential convergence rates

Machine learning approached through supervised learning requires expensi...
research
03/19/2019

Cross-task weakly supervised learning from instructional videos

In this paper we investigate learning visual models for the steps of ord...
research
10/27/2019

An Active Approach for Model Interpretation

Model interpretation, or explanation of a machine learning classifier, a...

Please sign up or login with your details

Forgot password? Click here to reset