Learning to track for spatio-temporal action localization

06/05/2015
by   Philippe Weinzaepfel, et al.
0

We propose an effective approach for spatio-temporal action localization in realistic videos. The approach first detects proposals at the frame-level and scores them with a combination of static and motion CNN features. It then tracks high-scoring proposals throughout the video using a tracking-by-detection approach. Our tracker relies simultaneously on instance-level and class-level detectors. The tracks are scored using a spatio-temporal motion histogram, a descriptor at the track level, in combination with the CNN features. Finally, we perform temporal localization of the action using a sliding-window approach at the track level. We present experimental results for spatio-temporal localization on the UCF-Sports, J-HMDB and UCF-101 action localization datasets, where our approach outperforms the state of the art with a margin of 15

READ FULL TEXT

page 3

page 7

page 8

research
06/28/2018

Modeling Spatio-Temporal Human Track Structure for Action Localization

This paper addresses spatio-temporal localization of human actions in vi...
research
05/04/2017

Action Tubelet Detector for Spatio-Temporal Action Localization

Current state-of-the-art approaches for spatio-temporal action localizat...
research
05/28/2019

Improving Action Localization by Progressive Cross-stream Cooperation

Spatio-temporal action localization consists of three levels of tasks: s...
research
07/19/2017

Detecting Parts for Action Localization

In this paper, we propose a new framework for action localization that t...
research
08/22/2022

Minkowski Tracker: A Sparse Spatio-Temporal R-CNN for Joint Object Detection and Tracking

Recent research in multi-task learning reveals the benefit of solving re...
research
05/04/2017

Am I Done? Predicting Action Progress in Videos

In this paper we introduce the problem of predicting action progress in ...
research
04/09/2022

E^2TAD: An Energy-Efficient Tracking-based Action Detector

Video action detection (spatio-temporal action localization) is usually ...

Please sign up or login with your details

Forgot password? Click here to reset