Weakly Supervised Learning with Side Information for Noisy Labeled Images

08/25/2020
by   Lele Cheng, et al.
0

In many real-world datasets, like WebVision, the performance of DNN based classifier is often limited by the noisy labeled data. To tackle this problem, some image related side information, such as captions and tags, often reveal underlying relationships across images. In this paper, we present an efficient weakly supervised learning by using a Side Information Network (SINet), which aims to effectively carry out a large scale classification with severely noisy labels. The proposed SINet consists of a visual prototype module and a noise weighting module. The visual prototype module is designed to generate a compact representation for each category by introducing the side information. The noise weighting module aims to estimate the correctness of each noisy image and produce a confidence score for image ranking during the training procedure. The propsed SINet can largely alleviate the negative impact of noisy image labels, and is beneficial to train a high performance CNN based classifier. Besides, we released a fine-grained product dataset called AliProducts, which contains more than 2.5 million noisy web images crawled from the internet by using queries generated from 50,000 fine-grained semantic classes. Extensive experiments on several popular benchmarks (i.e. Webvision, ImageNet and Clothing-1M) and our proposed AliProducts achieve state-of-the-art performance. The SINet has won the first place in the classification task on WebVision Challenge 2019, and outperformed other competitors by a large margin.

READ FULL TEXT
research
08/03/2018

CurriculumNet: Weakly Supervised Learning from Large-Scale Web Images

We present a simple yet efficient approach capable of training deep neur...
research
11/30/2016

Attend in groups: a weakly-supervised deep learning framework for learning from web data

Large-scale datasets have driven the rapid development of deep neural ne...
research
07/13/2021

eProduct: A Million-Scale Visual Search Benchmark to Address Product Recognition Challenges

Large-scale product recognition is one of the major applications of comp...
research
05/06/2022

Automatic Noisy Label Correction for Fine-Grained Entity Typing

Fine-grained entity typing (FET) aims to assign proper semantic types to...
research
11/02/2018

Learning from Large-scale Noisy Web Data with Ubiquitous Reweighting for Image Classification

Many advances of deep learning techniques originate from the efforts of ...
research
03/04/2023

Fine-Grained Classification with Noisy Labels

Learning with noisy labels (LNL) aims to ensure model generalization giv...
research
04/23/2022

GRM: Gradient Rectification Module for Visual Place Retrieval

Visual place retrieval aims to search images in the database that depict...

Please sign up or login with your details

Forgot password? Click here to reset