Am I Done? Predicting Action Progress in Videos

05/04/2017
by   Federico Becattini, et al.
0

In this paper we introduce the problem of predicting action progress in videos. We argue that this is an extremely important task because, on the one hand, it can be valuable for a wide range of applications and, on the other hand, it facilitates better action detection results. To solve this problem we introduce a novel approach, named ProgressNet, capable of predicting when an action takes place in a video, where it is located within the frames, and how far it has progressed during its execution. Motivated by the recent success obtained from the interaction of Convolutional and Recurrent Neural Networks, our model is based on a combination of the Faster R-CNN framework, to make framewise predictions, and LSTM networks, to estimate action progress through time. After introducing two evaluation protocols for the task at hand, we demonstrate the capability of our model to effectively predict action progress on the UCF-101 and J-HMDB datasets. Additionally, we show that exploiting action progress it is also possible to improve spatio-temporal localization.

READ FULL TEXT

page 4

page 5

page 7

page 9

research
03/01/2019

Progress Regression RNN for Online Spatial-Temporal Action Localization in Unconstrained Videos

Previous spatial-temporal action localization methods commonly follow th...
research
06/05/2015

Learning to track for spatio-temporal action localization

We propose an effective approach for spatio-temporal action localization...
research
04/25/2019

DynamoNet: Dynamic Action and Motion Network

In this paper, we are interested in self-supervised learning the motion ...
research
11/21/2014

Finding Action Tubes

We address the problem of action detection in videos. Driven by the late...
research
01/12/2018

Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic Experiences for Robot Action Execution

We present a novel deep neural network architecture for representing rob...
research
04/09/2022

E^2TAD: An Energy-Efficient Tracking-based Action Detector

Video action detection (spatio-temporal action localization) is usually ...
research
07/27/2018

Diagnosing Error in Temporal Action Detectors

Despite the recent progress in video understanding and the continuous ra...

Please sign up or login with your details

Forgot password? Click here to reset