Improving Face Recognition by Clustering Unlabeled Faces in the Wild

07/14/2020
by   Aruni RoyChowdhury, et al.
18

While deep face recognition has benefited significantly from large-scale labeled data, current research is focused on leveraging unlabeled data to further boost performance, reducing the cost of human annotation. Prior work has mostly been in controlled settings, where the labeled and unlabeled data sets have no overlapping identities by construction. This is not realistic in large-scale face recognition, where one must contend with such overlaps, the frequency of which increases with the volume of data. Ignoring identity overlap leads to significant labeling noise, as data from the same identity is split into multiple clusters. To address this, we propose a novel identity separation method based on extreme value theory. It is formulated as an out-of-distribution detection algorithm, and greatly reduces the problems caused by overlapping-identity label noise. Considering cluster assignments as pseudo-labels, we must also overcome the labeling noise from clustering errors. We propose a modulation of the cosine loss, where the modulation weights correspond to an estimate of clustering uncertainty. Extensive experiments on both controlled and real settings demonstrate our method's consistent improvements over supervised baselines, e.g., 11.6 verification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2019

Unknown Identity Rejection Loss: Utilizing Unlabeled Data for Face Recognition

Face recognition has advanced considerably with the availability of larg...
research
03/17/2020

Generalizing Face Representation with Unlabeled Data

In recent years, significant progress has been made in face recognition ...
research
02/09/2020

Asymmetric Rejection Loss for Fairer Face Recognition

Face recognition performance has seen a tremendous gain in recent years,...
research
09/05/2018

Consensus-Driven Propagation in Massive Unlabeled Data for Face Recognition

Face recognition has witnessed great progress in recent years, mainly at...
research
11/24/2016

Automatically Building Face Datasets of New Domains from Weakly Labeled Data with Pretrained Models

Training data are critical in face recognition systems. However, labelin...
research
04/04/2019

Learning to Cluster Faces on an Affinity Graph

Face recognition sees remarkable progress in recent years, and its perfo...
research
04/21/2023

Learn to Cluster Faces with Better Subgraphs

Face clustering can provide pseudo-labels to the massive unlabeled face ...

Please sign up or login with your details

Forgot password? Click here to reset