Scaling and Benchmarking Self-Supervised Visual Representation Learning

05/03/2019
by   Priya Goyal, et al.
0

Self-supervised learning aims to learn representations from the data itself without explicit manual supervision. Existing efforts ignore a crucial aspect of self-supervised learning - the ability to scale to large amount of data because self-supervision requires no manual labels. In this work, we revisit this principle and scale two popular self-supervised approaches to 100 million images. We show that by scaling on various axes (including data size and problem 'hardness'), one can largely match or even exceed the performance of supervised pre-training on a variety of tasks such as object detection, surface normal estimation (3D) and visual navigation using reinforcement learning. Scaling these methods also provides many interesting insights into the limitations of current self-supervised techniques and evaluations. We conclude that current self-supervised methods are not 'hard' enough to take full advantage of large scale data and do not seem to learn effective high level semantic representations. We also introduce an extensive benchmark across 9 different datasets and tasks. We believe that such a benchmark along with comparable evaluation settings is necessary to make meaningful progress.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/04/2019

Self-Supervised Learning of Pretext-Invariant Representations

The goal of self-supervised learning from images is to construct image r...
research
08/04/2017

CASSL: Curriculum Accelerated Self-Supervised Learning

Recent self-supervised learning approaches focus on using a few thousand...
research
06/09/2021

Self-supervised Feature Enhancement: Applying Internal Pretext Task to Supervised Learning

Traditional self-supervised learning requires CNNs using external pretex...
research
07/31/2020

Self-supervised learning through the eyes of a child

Within months of birth, children have meaningful expectations about the ...
research
07/29/2022

RCA: Ride Comfort-Aware Visual Navigation via Self-Supervised Learning

Under shared autonomy, wheelchair users expect vehicles to provide safe ...
research
08/09/2023

A degree of image identification at sub-human scales could be possible with more advanced clusters

The purpose of the research is to determine if currently available self-...
research
11/26/2019

Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning

Visual aesthetic assessment has been an active research field for decade...

Please sign up or login with your details

Forgot password? Click here to reset