The Devil of Face Recognition is in the Noise

07/31/2018
by   Fei Wang, et al.
2

The growing scale of face recognition datasets empowers us to train strong convolutional networks for face recognition. While a variety of architectures and loss functions have been devised, we still have a limited understanding of the source and consequence of label noise inherent in existing datasets. We make the following contributions: 1) We contribute cleaned subsets of popular face databases, i.e., MegaFace and MS-Celeb-1M datasets, and build a new large-scale noise-controlled IMDb-Face dataset. 2) With the original datasets and cleaned subsets, we profile and analyze label noise properties of MegaFace and MS-Celeb-1M. We show that a few orders more samples are needed to achieve the same accuracy yielded by a clean subset. 3) We study the association between different types of noise, i.e., label flips and outliers, with the accuracy of face recognition models. 4) We investigate ways to improve data cleanliness, including a comprehensive user study on the influence of data labeling strategies to annotation accuracy. The IMDb-Face dataset has been released on https://github.com/fwang91/IMDb-Face.

READ FULL TEXT

page 2

page 6

page 8

page 9

research
10/10/2022

BoundaryFace: A mining framework with noise label self-correction for Face Recognition

Face recognition has made tremendous progress in recent years due to the...
research
03/24/2020

Dataset Cleaning – A Cross Validation Methodology for Large Facial Datasets using Face Recognition

In recent years, large "in the wild" face datasets have been released in...
research
03/27/2021

Face Transformer for Recognition

Recently there has been great interests of Transformer not only in NLP b...
research
02/28/2019

MassFace: an efficient implementation using triplet loss for face recognition

In this paper we present an efficient implementation using triplet loss ...
research
07/18/2017

One-shot Face Recognition by Promoting Underrepresented Classes

We study in this paper the problem of one-shot face recognition, with th...
research
05/24/2022

OPOM: Customized Invisible Cloak towards Face Privacy Protection

While convenient in daily life, face recognition technologies also raise...
research
04/15/2022

Deep Unlearning via Randomized Conditionally Independent Hessians

Recent legislation has led to interest in machine unlearning, i.e., remo...

Please sign up or login with your details

Forgot password? Click here to reset