Relational Action Forecasting

04/08/2019
by   Chen Sun, et al.
16

This paper focuses on multi-person action forecasting in videos. More precisely, given a history of H previous frames, the goal is to detect actors and to predict their future actions for the next T frames. Our approach jointly models temporal and spatial interactions among different actors by constructing a recurrent graph, using actor proposals obtained with Faster R-CNN as nodes. Our method learns to select a subset of discriminative relations without requiring explicit supervision, thus enabling us to tackle challenging visual data. We refer to our model as Discriminative Relational Recurrent Network (DRRN). Evaluation of action prediction on AVA demonstrates the effectiveness of our proposed method compared to simpler baselines. Furthermore, we significantly improve performance on the task of early action classification on J-HMDB, from the previous SOTA of 48

READ FULL TEXT

page 1

page 2

page 7

page 8

research
12/14/2020

Temporal Relational Modeling with Self-Supervision for Action Segmentation

Temporal relational modeling in video is essential for human action unde...
research
01/11/2019

Anticipation and next action forecasting in video: an end-to-end model with memory

Action anticipation and forecasting in videos do not require a hat-trick...
research
12/16/2019

Predicting the Future: A Jointly Learnt Model for Action Anticipation

Inspired by human neurological structures for action anticipation, we pr...
research
10/26/2021

CTRN: Class-Temporal Relational Network for Action Detection

Action detection is an essential and challenging task, especially for de...
research
11/25/2022

Forecasting Actions and Characteristic 3D Poses

We propose to model longer-term future human behavior by jointly predict...
research
07/31/2023

AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?

Can we better anticipate an actor's future actions (e.g. mix eggs) by kn...
research
11/20/2020

Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled Videos

This paper focuses on weakly-supervised action alignment, where only the...

Please sign up or login with your details

Forgot password? Click here to reset