Temporal Action Localization Using Gated Recurrent Units

Temporal Action Localization (TAL) task in which the aim is to predict the start and end of each action and its class label has many applications in the real world. But due to its complexity, researchers have not reached great results compared to the action recognition task. The complexity is related to predicting precise start and end times for different actions in any video. In this paper, we propose a new network based on Gated Recurrent Unit (GRU) and two novel post-processing ideas for TAL task. Specifically, we propose a new design for the output layer of the GRU resulting in the so-called GRU-Splitted model. Moreover, linear interpolation is used to generate the action proposals with precise start and end times. Finally, to rank the generated proposals appropriately, we use a Learn to Rank (LTR) approach. We evaluated the performance of the proposed method on Thumos14 dataset. Results show the superiority of the performance of the proposed method compared to state-of-the-art. Especially in the mean Average Precision (mAP) metric at Intersection over Union (IoU) 0.7, we get 27.52 that of state-of-the-art methods.

READ FULL TEXT
research
06/13/2020

Temporal Fusion Network for Temporal Action Localization:Submission to ActivityNet Challenge 2020 (Task E)

This technical report analyzes a temporal action localization method we ...
research
04/13/2018

Precise Temporal Action Localization by Evolving Temporal Proposals

Locating actions in long untrimmed videos has been a challenging problem...
research
04/06/2023

Boundary-Denoising for Video Activity Localization

Video activity localization aims at understanding the semantic content i...
research
03/04/2017

CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos

Temporal action localization is an important yet challenging problem. Gi...
research
10/19/2018

Temporal Action Detection by Joint Identification-Verification

Temporal action detection aims at not only recognizing action category b...
research
12/01/2021

Graph Convolutional Module for Temporal Action Localization in Videos

Temporal action localization has long been researched in computer vision...
research
06/13/2021

A Stronger Baseline for Ego-Centric Action Detection

This technical report analyzes an egocentric video action detection meth...

Please sign up or login with your details

Forgot password? Click here to reset