Label Refinery: Improving ImageNet Classification through Label Progression

05/07/2018
by   Hessam Bagherinezhad, et al.
0

Among the three main components (data, labels, and models) of any supervised learning system, data and models have been the main subjects of active research. However, studying labels and their properties has received very little attention. Current principles and paradigms of labeling impose several challenges to machine learning algorithms. Labels are often incomplete, ambiguous, and redundant. In this paper we study the effects of various properties of labels and introduce the Label Refinery: an iterative procedure that updates the ground truth labels after examining the entire dataset. We show significant gain using refined labels across a wide range of models. Using a Label Refinery improves the state-of-the-art top-1 accuracy of (1) AlexNet from 59.3 to 67.2, (2) MobileNet from 70.6 to 73.39, (3) MobileNet-0.25 from 50.6 to 55.59, (4) VGG19 from 72.7 to 75.46, and (5) Darknet19 from 72.9 to 74.47.

READ FULL TEXT

page 1

page 3

page 4

page 13

research
06/20/2021

Improving Label Quality by Jointly Modeling Items and Annotators

We propose a fully Bayesian framework for learning ground truth labels f...
research
12/18/2022

Multi-Instance Partial-Label Learning: Towards Exploiting Dual Inexact Supervision

Weakly supervised machine learning algorithms are able to learn from amb...
research
06/26/2019

Near Optimal Stratified Sampling

The performance of a machine learning system is usually evaluated by usi...
research
02/05/2020

Exploratory Machine Learning with Unknown Unknowns

In conventional supervised learning, a training dataset is given with gr...
research
04/26/2018

Weak Labeling for Crowd Learning

Crowdsourcing has become very popular among the machine learning communi...
research
12/08/2020

Concept Drift and Covariate Shift Detection Ensemble with Lagged Labels

In model serving, having one fixed model during the entire often life-lo...
research
06/17/2019

Active Learning by Greedy Split and Label Exploration

Annotating large unlabeled datasets can be a major bottleneck for machin...

Please sign up or login with your details

Forgot password? Click here to reset