Frozen pretrained models have become a viable alternative to the
pretrai...
An important goal of self-supervised learning is to enable model pre-tra...
Recently, zero-shot image classification by vision-language pre-training...
This paper presents SimMIM, a simple framework for masked image modeling...
We present techniques for scaling Swin Transformer up to 3 billion param...
We introduce MixTraining, a new training paradigm for object detection t...
We are witnessing a modeling shift from CNN to Transformers in computer
...
This paper presents a new vision Transformer, called Swin Transformer, t...
Contrastive learning methods for unsupervised visual representation lear...
This paper presents parametric instance classification (PIC) for unsuper...
This paper introduces a negative margin loss to metric learning based
fe...