Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection

01/28/2018
by   Yancheng Bai, et al.
0

Activity detection is a fundamental problem in computer vision. Detecting activities of different temporal scales is particularly challenging. In this paper, we propose the contextual multi-scale region convolutional 3D network (CMS-RC3D) for activity detection. To deal with the inherent temporal scale variability of activity instances, the temporal feature pyramid is used to represent activities of different temporal scales. On each level of the temporal feature pyramid, an activity proposal detector and an activity classifier are learned to detect activities of specific temporal scales. Temporal contextual information is fused into activity classifiers for better recognition. More importantly, the entire model at all levels can be trained end-to-end. Our CMS-RC3D detector can deal with activities at all temporal scale ranges with only a single pass through the backbone network. We test our detector on two public activity detection benchmarks, THUMOS14 and ActivityNet. Extensive experiments show that the proposed CMS-RC3D detector outperforms state-of-the-art methods on THUMOS14 by a substantial margin and achieves comparable results on ActivityNet despite using a shallow feature extractor.

READ FULL TEXT

page 1

page 3

page 8

research
08/07/2018

Dynamic Temporal Pyramid Network: A Closer Look at Multi-Scale Modeling for Activity Detection

Recognizing instances at different scales simultaneously is a fundamenta...
research
03/22/2017

R-C3D: Region Convolutional 3D Network for Temporal Activity Detection

We address the problem of activity detection in continuous, untrimmed vi...
research
07/20/2017

Multi-Branch Fully Convolutional Network for Face Detection

Face detection is a fundamental problem in computer vision. It is still ...
research
07/21/2018

S3D: Single Shot multi-Span Detector via Fully 3D Convolutional Networks

In this paper, we present a novel Single Shot multi-Span Detector for te...
research
06/05/2019

Two-Stream Region Convolutional 3D Network for Temporal Activity Detection

We address the problem of temporal activity detection in continuous, unt...
research
07/04/2021

SSPNet: Scale Selection Pyramid Network for Tiny Person Detection from UAV Images

With the increasing demand for search and rescue, it is highly demanded ...
research
03/08/2017

A Pursuit of Temporal Accuracy in General Activity Detection

Detecting activities in untrimmed videos is an important but challenging...

Please sign up or login with your details

Forgot password? Click here to reset