Relation Modeling in Spatio-Temporal Action Localization

06/15/2021
by   Yutong Feng, et al.
0

This paper presents our solution to the AVA-Kinetics Crossover Challenge of ActivityNet workshop at CVPR 2021. Our solution utilizes multiple types of relation modeling methods for spatio-temporal action detection and adopts a training strategy to integrate multiple relation modeling in end-to-end training over the two large-scale video datasets. Learning with memory bank and finetuning for long-tailed distribution are also investigated to further improve the performance. In this paper, we detail the implementations of our solution and provide experiments results and corresponding discussions. We finally achieve 40.67 mAP on the test set of AVA-Kinetics.

READ FULL TEXT
research
06/29/2018

YH Technologies at ActivityNet Challenge 2018

This notebook paper presents an overview and comparative analysis of our...
research
04/01/2020

Spatio-Temporal Action Detection with Multi-Object Interaction

Spatio-temporal action detection in videos requires localizing the actio...
research
07/25/2019

Submission to ActivityNet Challenge 2019: Task B Spatio-temporal Action Localization

This technical report present an overview of our system proposed for the...
research
04/24/2023

End-to-End Spatio-Temporal Action Localisation with Video Transformers

The most performant spatio-temporal action localisation models use exter...
research
06/14/2020

Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization

Localizing persons and recognizing their actions from videos is a challe...
research
03/16/2020

A Generative Learning Approach for Spatio-temporal Modeling in Connected Vehicular Network

Spatio-temporal modeling of wireless access latency is of great importan...

Please sign up or login with your details

Forgot password? Click here to reset