Face Recognition in the age of CLIP Billion image datasets

01/18/2023
by   Aaditya Bhat, et al.
0

CLIP (Contrastive Language-Image Pre-training) models developed by OpenAI have achieved outstanding results on various image recognition and retrieval tasks, displaying strong zero-shot performance. This means that they are able to perform effectively on tasks for which they have not been explicitly trained. Inspired by the success of OpenAI CLIP, a new publicly available dataset called LAION-5B was collected which resulted in the development of open ViT-H/14, ViT-G/14 models that outperform the OpenAI L/14 model. The LAION-5B dataset also released an approximate nearest neighbor index, with a web interface for search subset creation. In this paper, we evaluate the performance of various CLIP models as zero-shot face recognizers. Our findings show that CLIP models perform well on face recognition tasks, but increasing the size of the CLIP model does not necessarily lead to improved accuracy. Additionally, we investigate the robustness of CLIP models against data poisoning attacks by testing their performance on poisoned data. Through this analysis, we aim to understand the potential consequences and misuse of search engines built using CLIP models, which could potentially function as unintentional face recognition engines.

READ FULL TEXT
research
10/28/2021

FocusFace: Multi-task Contrastive Learning for Masked Face Recognition

SARS-CoV-2 has presented direct and indirect challenges to the scientifi...
research
03/24/2020

Dataset Cleaning – A Cross Validation Methodology for Large Facial Datasets using Face Recognition

In recent years, large "in the wild" face datasets have been released in...
research
03/23/2022

On the (Limited) Generalization of MasterFace Attacks and Its Relation to the Capacity of Face Representations

A MasterFace is a face image that can successfully match against a large...
research
10/13/2017

Recent Advances in Zero-shot Recognition

With the recent renaissance of deep convolution neural networks, encoura...
research
05/12/2020

A Novel Distributed Approximate Nearest Neighbor Method for Real-time Face Recognition

Nowadays face recognition and more generally, image recognition have man...
research
06/02/2022

Prefix Conditioning Unifies Language and Label Supervision

Vision-language contrastive learning suggests a new learning paradigm by...
research
09/08/2022

FETA: Towards Specializing Foundation Models for Expert Task Applications

Foundation Models (FMs) have demonstrated unprecedented capabilities inc...

Please sign up or login with your details

Forgot password? Click here to reset