On the robustness of self-supervised representations for multi-view object classification

07/27/2022
by   David Torpey, et al.
0

It is known that representations from self-supervised pre-training can perform on par, and often better, on various downstream tasks than representations from fully-supervised pre-training. This has been shown in a host of settings such as generic object classification and detection, semantic segmentation, and image retrieval. However, some issues have recently come to the fore that demonstrate some of the failure modes of self-supervised representations, such as performance on non-ImageNet-like data, or complex scenes. In this paper, we show that self-supervised representations based on the instance discrimination objective lead to better representations of objects that are more robust to changes in the viewpoint and perspective of the object. We perform experiments of modern self-supervised methods against multiple supervised baselines to demonstrate this, including approximating object viewpoint variation through homographies, and real-world tests based on several multi-view datasets. We find that self-supervised representations are more robust to object viewpoint and appear to encode more pertinent information about objects that facilitate the recognition of objects from novel views.

READ FULL TEXT

page 4

page 8

page 10

research
03/14/2022

UniVIP: A Unified Framework for Self-Supervised Visual Pre-training

Self-supervised learning (SSL) holds promise in leveraging large amounts...
research
09/19/2022

NeRF-SOS: Any-View Self-supervised Object Segmentation from Complex Real-World Scenes

Neural volumetric representations have shown the potential that Multi-la...
research
02/22/2023

Steerable Equivariant Representation Learning

Pre-trained deep image representations are useful for post-training task...
research
05/30/2023

A Computational Account Of Self-Supervised Visual Learning From Egocentric Object Play

Research in child development has shown that embodied experience handlin...
research
11/05/2022

Local Manifold Augmentation for Multiview Semantic Consistency

Multiview self-supervised representation learning roots in exploring sem...
research
09/29/2016

Multi-view Self-supervised Deep Learning for 6D Pose Estimation in the Amazon Picking Challenge

Robot warehouse automation has attracted significant interest in recent ...
research
06/24/2021

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

We present an algorithm that learns a coarse 3D representation of object...

Please sign up or login with your details

Forgot password? Click here to reset