Diverse Imagenet Models Transfer Better

04/19/2022
by   Niv Nayman, et al.
0

A commonly accepted hypothesis is that models with higher accuracy on Imagenet perform better on other downstream tasks, leading to much research dedicated to optimizing Imagenet accuracy. Recently this hypothesis has been challenged by evidence showing that self-supervised models transfer better than their supervised counterparts, despite their inferior Imagenet accuracy. This calls for identifying the additional factors, on top of Imagenet accuracy, that make models transferable. In this work we show that high diversity of the features learnt by the model promotes transferability jointly with Imagenet accuracy. Encouraged by the recent transferability results of self-supervised models, we propose a method that combines self-supervised and supervised pretraining to generate models with both high diversity and high accuracy, and as a result high transferability. We demonstrate our results on several architectures and multiple downstream tasks, including both single-label and multi-label classification.

READ FULL TEXT

page 5

page 6

page 21

page 28

page 29

page 30

page 31

research
11/26/2020

How Well Do Self-Supervised Models Transfer?

Self-supervised visual representation learning has seen huge progress in...
research
08/23/2021

How Transferable Are Self-supervised Features in Medical Image Classification Tasks?

Transfer learning has become a standard practice to mitigate the lack of...
research
07/25/2022

Dynamic Channel Selection in Self-Supervised Learning

Whilst computer vision models built using self-supervised approaches are...
research
09/02/2022

Feature diversity in self-supervised learning

Many studies on scaling laws consider basic factors such as model size, ...
research
11/30/2021

MC-SSL0.0: Towards Multi-Concept Self-Supervised Learning

Self-supervised pretraining is the method of choice for natural language...
research
04/11/2023

A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation

Self-Supervised Learning (SSL) models rely on a pretext task to learn re...
research
09/11/2023

Towards generalisable and calibrated synthetic speech detection with self-supervised representations

Generalisation – the ability of a model to perform well on unseen data –...

Please sign up or login with your details

Forgot password? Click here to reset