Beyond Gaussian Pyramid: Multi-skip Feature Stacking for Action Recognition

11/24/2014
by   Zhenzhong Lan, et al.
0

Most state-of-the-art action feature extractors involve differential operators, which act as highpass filters and tend to attenuate low frequency action information. This attenuation introduces bias to the resulting features and generates ill-conditioned feature matrices. The Gaussian Pyramid has been used as a feature enhancing technique that encodes scale-invariant characteristics into the feature space in an attempt to deal with this attenuation. However, at the core of the Gaussian Pyramid is a convolutional smoothing operation, which makes it incapable of generating new features at coarse scales. In order to address this problem, we propose a novel feature enhancing technique called Multi-skIp Feature Stacking (MIFS), which stacks features extracted using a family of differential filters parameterized with multiple time skips and encodes shift-invariance into the frequency space. MIFS compensates for information lost from using differential operators by recapturing information at coarse scales. This recaptured information allows us to match actions at different speeds and ranges of motion. We prove that MIFS enhances the learnability of differential-based features exponentially. The resulting feature matrices from MIFS have much smaller conditional numbers and variances than those from conventional methods. Experimental results show significantly improved performance on challenging action recognition and event detection tasks. Specifically, our method exceeds the state-of-the-arts on Hollywood2, UCF101 and UCF50 datasets and is comparable to state-of-the-arts on HMDB51 and Olympics Sports datasets. MIFS can also be used as a speedup strategy for feature extraction with minimal or no accuracy cost.

READ FULL TEXT
research
11/28/2017

Revisiting hand-crafted feature for action recognition: a set of improved dense trajectories

We propose a feature for action recognition called Trajectory-Set (TS), ...
research
08/29/2014

Temporal Extension of Scale Pyramid and Spatial Pyramid Matching for Action Recognition

Historically, researchers in the field have spent a great deal of effort...
research
11/20/2017

Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion

Action recognition is an important yet challenging task in computer visi...
research
10/15/2015

Beyond Spatial Pyramid Matching: Space-time Extended Descriptor for Action Recognition

We address the problem of generating video features for action recogniti...
research
02/24/2023

Frequency and Scale Perspectives of Feature Extraction

Convolutional neural networks (CNNs) have achieved superior performance ...
research
06/29/2023

Residual Feature Pyramid Network for Enhancement of Vascular Patterns

The accuracy of finger vein recognition systems gets degraded due to low...
research
08/26/2020

Effective Action Recognition with Embedded Key Point Shifts

Temporal feature extraction is an essential technique in video-based act...

Please sign up or login with your details

Forgot password? Click here to reset