MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

07/27/2016
by   Yandong Guo, et al.
0

In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information provided by the knowledge base helps to conduct disambiguation and improve the recognition accuracy, and contributes to various real-world applications, such as image captioning and news video analysis. Associated with this task, we design and provide concrete measurement set, evaluation protocol, as well as training data. We also present in details our experiment setup and report promising baseline results. Our benchmark task could lead to one of the largest classification problems in computer vision. To the best of our knowledge, our training dataset, which contains 10M images in version 1, is the largest publicly available one in the world.

READ FULL TEXT

page 7

page 9

page 12

research
03/09/2017

WebCaricature: a benchmark for caricature face recognition

Caricatures are facial drawings by artists with exaggeration on certain ...
research
10/05/2022

DigiFace-1M: 1 Million Digital Face Images for Face Recognition

State-of-the-art face recognition models show impressive accuracy, achie...
research
10/11/2021

EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset

Recent deep face hallucination methods show stunning performance in supe...
research
03/06/2021

WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition

In this paper, we contribute a new million-scale face benchmark containi...
research
04/06/2021

On the Applicability of Synthetic Data for Face Recognition

Face verification has come into increasing focus in various applications...
research
07/16/2020

Semi-Siamese Training for Shallow Face Learning

Most existing public face datasets, such as MS-Celeb-1M and VGGFace2, pr...
research
12/21/2018

Face Hallucination Revisited: An Exploratory Study on Dataset Bias

Contemporary face hallucination (FH) models exhibit considerable ability...

Please sign up or login with your details

Forgot password? Click here to reset