Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 2020

07/20/2020
by   Haisheng Su, et al.
2

This technical report presents an overview of our solution used in the submission to ActivityNet Challenge 2020 Task 1 (temporal action localization/detection). Temporal action localization requires to not only precisely locate the temporal boundaries of action instances, but also accurately classify the untrimmed videos into specific categories. In this paper, we decouple the temporal action localization task into two stages (i.e. proposal generation and classification) and enrich the proposal diversity through exhaustively exploring the influences of multiple components from different but complementary perspectives. Specifically, in order to generate high-quality proposals, we consider several factors including the video feature encoder, the proposal generator, the proposal-proposal relations, the scale imbalance, and ensemble strategy. Finally, in order to obtain accurate detections, we need to further train an optimal video classifier to recognize the generated proposals. Our proposed scheme achieves the state-of-the-art performance on the temporal action localization task with 42.26 average mAP on the challenge testing set.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 8

research
07/29/2019

Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2

This technical report presents an overview of our solution used in the s...
research
07/27/2021

Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021

This technical report presents an overview of our solution used in the s...
research
03/09/2021

PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization

Temporal action localization is an important and challenging task that a...
research
07/05/2022

MVP: Robust Multi-View Practice for Driving Action Localization

Distracted driving causes thousands of deaths per year, and how to apply...
research
09/15/2020

BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation

Generating human action proposals in untrimmed videos is an important ye...
research
09/07/2019

Graph Convolutional Networks for Temporal Action Localization

Most state-of-the-art action localization systems process each action pr...
research
03/06/2023

Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator

Temporal action localization in videos presents significant challenges i...

Please sign up or login with your details

Forgot password? Click here to reset