Inter-Instance Similarity Modeling for Contrastive Learning

06/21/2023
by   Chengchao Shen, et al.
0

The existing contrastive learning methods widely adopt one-hot instance discrimination as pretext task for self-supervised learning, which inevitably neglects rich inter-instance similarities among natural images, then leading to potential representation degeneration. In this paper, we propose a novel image mix method, PatchMix, for contrastive learning in Vision Transformer (ViT), to model inter-instance similarities among images. Following the nature of ViT, we randomly mix multiple images from mini-batch in patch level to construct mixed image patch sequences for ViT. Compared to the existing sample mix methods, our PatchMix can flexibly and efficiently mix more than two images and simulate more complicated similarity relations among natural images. In this manner, our contrastive framework can significantly reduce the gap between contrastive objective and ground truth in reality. Experimental results demonstrate that our proposed method significantly outperforms the previous state-of-the-art on both ImageNet-1K and CIFAR datasets, e.g., 3.0 ImageNet-1K and 8.7 achieves the leading transfer performance on downstream tasks, object detection and instance segmentation on COCO dataset. The code is available at https://github.com/visresearch/patchmix

READ FULL TEXT

page 1

page 3

page 10

page 16

page 17

page 18

page 19

research
06/05/2023

Asymmetric Patch Sampling for Contrastive Learning

Asymmetric appearance between positive pair effectively reduces the risk...
research
07/22/2022

Adaptive Soft Contrastive Learning

Self-supervised learning has recently achieved great success in represen...
research
04/01/2021

Jigsaw Clustering for Unsupervised Visual Representation Learning

Unsupervised representation learning with contrastive learning achieved ...
research
03/17/2022

Modulated Contrast for Versatile Image Synthesis

Perceiving the similarity between images has been a long-standing and fu...
research
06/10/2021

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations

Contrastive self-supervised learning has outperformed supervised pretrai...
research
03/30/2023

Mixed Autoencoder for Self-supervised Visual Representation Learning

Masked Autoencoder (MAE) has demonstrated superior performance on variou...
research
08/18/2023

Rethinking Image Forgery Detection via Contrastive Learning and Unsupervised Clustering

Image forgery detection aims to detect and locate forged regions in an i...

Please sign up or login with your details

Forgot password? Click here to reset