Self-Supervised Representation Learning from Temporal Ordering of Automated Driving Sequences

02/17/2023
by   Christopher Lang, et al.
0

Self-supervised feature learning enables perception systems to benefit from the vast amount of raw data being recorded by vehicle fleets all over the world. However, their potential to learn dense representations from sequential data has been relatively unexplored. In this work, we propose TempO, a temporal ordering pretext task for pre-training region-level feature representations for perception tasks. We embed each frame by an unordered set of proposal feature vectors, a representation that is natural for instance-level perception architectures, and formulate the sequential ordering prediction by comparing similarities between sets of feature vectors in a transformer-based multi-frame architecture. Extensive evaluation in automated driving domains on the BDD100K and MOT17 datasets shows that our TempO approach outperforms existing self-supervised single-frame pre-training methods as well as supervised transfer learning initialization strategies on standard object detection and multi-object tracking benchmarks.

READ FULL TEXT

page 5

page 6

page 8

research
06/10/2021

MST: Masked Self-Supervised Transformer for Visual Representation

Transformer has been widely used for self-supervised pre-training in Nat...
research
04/25/2023

Self-Supervised Multi-Object Tracking From Consistency Across Timescales

Self-supervised multi-object trackers have the potential to leverage the...
research
08/14/2023

PatchContrast: Self-Supervised Pre-training for 3D Object Detection

Accurately detecting objects in the environment is a key challenge for a...
research
07/14/2023

DreamTeacher: Pretraining Image Backbones with Deep Generative Models

In this work, we introduce a self-supervised feature representation lear...
research
06/01/2023

CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV Perception

Perception is crucial in the realm of autonomous driving systems, where ...
research
01/15/2022

Semantic decoupled representation learning for remote sensing image change detection

Contemporary transfer learning-based methods to alleviate the data insuf...
research
11/22/2021

Benchmarking Detection Transfer Learning with Vision Transformers

Object detection is a central downstream task used to test if pre-traine...

Please sign up or login with your details

Forgot password? Click here to reset