Mugs: A Multi-Granular Self-Supervised Learning Framework

03/27/2022
by   Pan Zhou, et al.
1

In self-supervised learning, multi-granular features are heavily desired though rarely investigated, as different downstream tasks (e.g., general and fine-grained classification) often require different or multi-granular features, e.g. fine- or coarse-grained one or their mixture. In this work, for the first time, we propose an effective MUlti-Granular Self-supervised learning (Mugs) framework to explicitly learn multi-granular visual features. Mugs has three complementary granular supervisions: 1) an instance discrimination supervision (IDS), 2) a novel local-group discrimination supervision (LGDS), and 3) a group discrimination supervision (GDS). IDS distinguishes different instances to learn instance-level fine-grained features. LGDS aggregates features of an image and its neighbors into a local-group feature, and pulls local-group features from different crops of the same image together and push them away for others. It provides complementary instance supervision to IDS via an extra alignment on local neighbors, and scatters different local-groups separately to increase discriminability. Accordingly, it helps learn high-level fine-grained features at a local-group level. Finally, to prevent similar local-groups from being scattered randomly or far away, GDS brings similar samples close and thus pulls similar local-groups together, capturing coarse-grained features at a (semantic) group level. Consequently, Mugs can capture three granular features that often enjoy higher generality on diverse downstream tasks over single-granular features, e.g. instance-level fine-grained features in contrastive learning. By only pretraining on ImageNet-1K, Mugs sets new SoTA linear probing accuracy 82.1% on ImageNet-1K and improves previous SoTA by 1.1%. It also surpasses SoTAs on other tasks, e.g. transfer learning, detection and segmentation.

READ FULL TEXT

page 2

page 13

page 22

page 23

page 24

research
07/29/2021

Self-Supervised Learning for Fine-Grained Image Classification

Fine-grained image classification involves identifying different subcate...
research
11/12/2014

Deep Multi-Instance Transfer Learning

We present a new approach for transferring knowledge from groups to indi...
research
02/16/2022

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision

Discriminative self-supervised learning allows training models on any ra...
research
07/27/2020

Few-shot Knowledge Transfer for Fine-grained Cartoon Face Generation

In this paper, we are interested in generating fine-grained cartoon face...
research
03/30/2021

Benchmarking Representation Learning for Natural World Image Collections

Recent progress in self-supervised learning has resulted in models that ...
research
08/09/2020

Unsupervised Feature Learning by Cross-Level Discrimination between Instances and Groups

Unsupervised feature learning has made great strides with invariant mapp...
research
11/02/2022

Beyond Instance Discrimination: Relation-aware Contrastive Self-supervised Learning

Contrastive self-supervised learning (CSL) based on instance discriminat...

Please sign up or login with your details

Forgot password? Click here to reset