ProtoNet: Learning from Web Data with Memory

06/28/2019
by   Yi Tu, et al.
6

Learning from web data has attracted lots of research interest in recent years. However, crawled web images usually have two types of noises, label noise and background noise, which induce extra difficulties in utilizing them effectively. Most existing methods either rely on human supervision or ignore the background noise. In this paper, we propose the novel ProtoNet, which is capable of handling these two types of noises together, without the supervision of clean images in the training stage. Particularly, we use a memory module to identify the representative and discriminative prototypes for each category. Then, we remove noisy images and noisy region proposals from the web dataset with the aid of the memory module. Our approach is efficient and can be easily integrated into arbitrary CNN model. Extensive experiments on four benchmark datasets demonstrate the effectiveness of our method.

READ FULL TEXT

page 1

page 3

page 8

research
03/10/2018

Webly Supervised Learning with Category-level Semantic Information

As tons of photos are being uploaded to public websites (e.g., Flickr, B...
research
06/21/2021

Open-set Label Noise Can Improve Robustness Against Inherent Label Noise

Learning with noisy labels is a practically challenging problem in weakl...
research
01/23/2021

Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Noisy Samples and Utilizing Hard Ones

Labeling objects at a subordinate level typically requires expert knowle...
research
10/26/2021

Addressing out-of-distribution label noise in webly-labelled data

A recurring focus of the deep learning community is towards reducing the...
research
01/08/2022

Counteracting Dark Web Text-Based CAPTCHA with Generative Adversarial Learning for Proactive Cyber Threat Intelligence

Automated monitoring of dark web (DW) platforms on a large scale is the ...
research
08/06/2019

Deep Self-Learning From Noisy Labels

ConvNets achieve good results when training from clean data, but learnin...
research
12/27/2022

Truncate-Split-Contrast: A Framework for Learning from Mislabeled Videos

Learning with noisy label (LNL) is a classic problem that has been exten...

Please sign up or login with your details

Forgot password? Click here to reset