Hierarchical Graph-RNNs for Action Detection of Multiple Activities

01/21/2021
by   Sovan Biswas, et al.
0

In this paper, we propose an approach that spatially localizes the activities in a video frame where each person can perform multiple activities at the same time. Our approach takes the temporal scene context as well as the relations of the actions of detected persons into account. While the temporal context is modeled by a temporal recurrent neural network (RNN), the relations of the actions are modeled by a graph RNN. Both networks are trained together and the proposed approach achieves state of the art results on the AVA dataset.

READ FULL TEXT

page 1

page 2

research
03/31/2020

SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation

Temporal action segmentation is a topic of increasing interest, however,...
research
01/09/2019

Using stigmergy as a computational memory in the design of recurrent neural networks

In this paper, a novel architecture of Recurrent Neural Network (RNN) is...
research
06/13/2017

Action Search: Learning to Search for Human Activities in Untrimmed Videos

Traditional approaches for action detection use trimmed data to learn so...
research
06/01/2020

RNNs on Monitoring Physical Activity Energy Expenditure in Older People

Through the quantification of physical activity energy expenditure (PAEE...
research
09/21/2023

A Click Ahead: Real-Time Forecasting of Keyboard and Mouse Actions using RNNs and Computer Vision

Computer input is more complex than a sequence of single mouse clicks an...
research
11/28/2016

Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition

We present a unified framework for understanding human social behaviors ...
research
01/14/2020

EGO-TOPO: Environment Affordances from Egocentric Video

First-person video naturally brings the use of a physical environment to...

Please sign up or login with your details

Forgot password? Click here to reset