Visual Recognition with Deep Nearest Centroids

09/15/2022
by   Wenguan Wang, et al.
6

We devise deep nearest centroids (DNC), a conceptually elegant yet surprisingly effective network for large-scale visual recognition, by revisiting Nearest Centroids, one of the most classic and simple classifiers. Current deep models learn the classifier in a fully parametric manner, ignoring the latent data structure and lacking simplicity and explainability. DNC instead conducts nonparametric, case-based reasoning; it utilizes sub-centroids of training samples to describe class distributions and clearly explains the classification as the proximity of test data and the class sub-centroids in the feature space. Due to the distance-based nature, the network output dimensionality is flexible, and all the learnable parameters are only for data embedding. That means all the knowledge learnt for ImageNet classification can be completely transferred for pixel recognition learning, under the "pre-training and fine-tuning" paradigm. Apart from its nested simplicity and intuitive decision-making mechanism, DNC can even possess ad-hoc explainability when the sub-centroids are selected as actual training images that humans can view and inspect. Compared with parametric counterparts, DNC performs better on image classification (CIFAR-10, ImageNet) and greatly boots pixel recognition (ADE20K, Cityscapes), with improved transparency and fewer learnable parameters, using various network architectures (ResNet, Swin) and segmentation models (FCN, DeepLabV3, Swin). We feel this work brings fundamental insights into related fields.

READ FULL TEXT

page 2

page 4

page 8

page 9

page 20

page 21

page 23

research
03/28/2022

Rethinking Semantic Segmentation: A Prototype View

Prevalent semantic segmentation solutions, despite their different netwo...
research
02/07/2022

Corrupted Image Modeling for Self-Supervised Visual Pre-Training

We introduce Corrupted Image Modeling (CIM) for self-supervised visual p...
research
08/14/2018

Improving Generalization via Scalable Neighborhood Component Analysis

Current major approaches to visual recognition follow an end-to-end form...
research
12/18/2022

Bort: Towards Explainable Neural Networks with Bounded Orthogonal Constraint

Deep learning has revolutionized human society, yet the black-box nature...
research
10/17/2017

Learning to Learn Image Classifiers with Informative Visual Analogy

In recent years, we witnessed a huge success of Convolutional Neural Net...
research
07/29/2020

Generative Classifiers as a Basis for Trustworthy Computer Vision

With the maturing of deep learning systems, trustworthiness is becoming ...
research
11/23/2022

Self-Supervised Learning based on Heat Equation

This paper presents a new perspective of self-supervised learning based ...

Please sign up or login with your details

Forgot password? Click here to reset