Ego-Vehicle Action Recognition based on Semi-Supervised Contrastive Learning

03/02/2023
by   Chihiro Noguchi, et al.
0

In recent years, many automobiles have been equipped with cameras, which have accumulated an enormous amount of video footage of driving scenes. Autonomous driving demands the highest level of safety, for which even unimaginably rare driving scenes have to be collected in training data to improve the recognition accuracy for specific scenes. However, it is prohibitively costly to find very few specific scenes from an enormous amount of videos. In this article, we show that proper video-to-video distances can be defined by focusing on ego-vehicle actions. It is well known that existing methods based on supervised learning cannot handle videos that do not fall into predefined classes, though they work well in defining video-to-video distances in the embedding space between labeled videos. To tackle this problem, we propose a method based on semi-supervised contrastive learning. We consider two related but distinct contrastive learning: standard graph contrastive learning and our proposed SOIA-based contrastive learning. We observe that the latter approach can provide more sensible video-to-video distances between unlabeled videos. Next, the effectiveness of our method is quantified by evaluating the classification performance of the ego-vehicle action recognition using HDD dataset, which shows that our method including unlabeled data in training significantly outperforms the existing methods using only labeled data in training.

READ FULL TEXT

page 1

page 8

page 14

page 15

page 16

page 17

page 18

page 19

research
02/04/2021

Semi-Supervised Action Recognition with Temporal Contrastive Learning

Learning to recognize actions from only a handful of labeled videos is a...
research
07/12/2022

Contrastive Learning for Online Semi-Supervised General Continual Learning

We study Online Continual Learning with missing labels and propose SemiC...
research
11/25/2021

Learning from Temporal Gradient for Semi-supervised Action Recognition

Semi-supervised video action recognition tends to enable deep neural net...
research
03/22/2016

Multi-velocity neural networks for gesture recognition in videos

We present a new action recognition deep neural network which adaptively...
research
11/30/2020

Annotation-Efficient Untrimmed Video Action Recognition

Deep learning has achieved great success in recognizing video actions, b...
research
05/07/2021

Video Class Agnostic Segmentation with Contrastive Learning for Autonomous Driving

Semantic segmentation in autonomous driving predominantly focuses on lea...
research
03/01/2021

Fool Me Once: Robust Selective Segmentation via Out-of-Distribution Detection with Contrastive Learning

In this work, we train a network to simultaneously perform segmentation ...

Please sign up or login with your details

Forgot password? Click here to reset