Exploring the Limits of Deep Image Clustering using Pretrained Models

03/31/2023
by   Nikolas Adaloglou, et al.
0

We present a general methodology that learns to classify images without labels by leveraging pretrained feature extractors. Our approach involves self-distillation training of clustering heads, based on the fact that nearest neighbors in the pretrained feature space are likely to share the same label. We propose a novel objective to learn associations between images by introducing a variant of pointwise mutual information together with instance weighting. We demonstrate that the proposed objective is able to attenuate the effect of false positive pairs while efficiently exploiting the structure in the pretrained feature space. As a result, we improve the clustering accuracy over k-means on 17 different pretrained models by 6.1% and 12.2% on ImageNet and CIFAR100, respectively. Finally, using self-supervised pretrained vision transformers we push the clustering accuracy on ImageNet to 61.6%. The code will be open-sourced.

READ FULL TEXT

page 7

page 13

page 16

page 17

research
02/24/2020

Deep Nearest Neighbor Anomaly Detection

Nearest neighbors is a successful and long-standing technique for anomal...
research
03/31/2023

LaCViT: A Label-aware Contrastive Training Framework for Vision Transformers

Vision Transformers have been incredibly effective when tackling compute...
research
03/01/2022

Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology

Tissue phenotyping is a fundamental task in learning objective character...
research
03/10/2023

Contrastive Language-Image Pretrained (CLIP) Models are Powerful Out-of-Distribution Detectors

We present a comprehensive experimental study on pretrained feature extr...
research
05/20/2019

Self-Supervised Similarity Learning for Digital Pathology

Using features extracted from networks pretrained on ImageNet is a commo...
research
03/28/2022

Core Risk Minimization using Salient ImageNet

Deep neural networks can be unreliable in the real world especially when...
research
07/06/2017

CNN features are also great at unsupervised classification

This paper aims at providing insight on the transferability of deep CNN ...

Please sign up or login with your details

Forgot password? Click here to reset