Deep Concept-wise Temporal Convolutional Networks for Action Localization

08/26/2019
by   Xin Li, et al.
2

Existing action localization approaches adopt shallow temporal convolutional networks (, TCN) on 1D feature map extracted from video frames. In this paper, we empirically find that stacking more conventional temporal convolution layers actually deteriorates action classification performance, possibly ascribing to that all channels of 1D feature map, which generally are highly abstract and can be regarded as latent concepts, are excessively recombined in temporal convolution. To address this issue, we introduce a novel concept-wise temporal convolution (CTC) layer as an alternative to conventional temporal convolution layer for training deeper action localization networks. Instead of recombining latent concepts, CTC layer deploys a number of temporal filters to each concept separately with shared filter parameters across concepts. Thus can capture common temporal patterns of different concepts and significantly enrich representation ability. Via stacking CTC layers, we proposed a deep concept-wise temporal convolutional network (C-TCN), which boosts the state-of-the-art action localization performance on THUMOS'14 from 42.8 to 52.1 in terms of mAP(%), achieving a relative improvement of 21.7%. Favorable result is also obtained on ActivityNet.

READ FULL TEXT

page 3

page 4

page 8

research
06/24/2021

Exploring Stronger Feature for Temporal Action Localization

Temporal action localization aims to localize starting and ending time w...
research
05/25/2019

Exploring Feature Representation and Training strategies in Temporal Action Localization

Temporal action localization has recently attracted significant interest...
research
11/22/2017

Temporal 3D ConvNets: New Architecture and Transfer Learning for Video Classification

The work in this paper is driven by the question how to exploit the temp...
research
10/15/2020

Why Layer-Wise Learning is Hard to Scale-up and a Possible Solution via Accelerated Downsampling

Layer-wise learning, as an alternative to global back-propagation, is ea...
research
11/19/2019

Cross-Class Relevance Learning for Temporal Concept Localization

We present a novel Cross-Class Relevance Learning approach for the task ...
research
05/21/2020

Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism

Formant tracking is one of the most fundamental problems in speech proce...
research
06/29/2020

Explainable 3D Convolutional Neural Networks by Learning Temporal Transformations

In this paper we introduce the temporally factorized 3D convolution (3TC...

Please sign up or login with your details

Forgot password? Click here to reset