Conquering the CNN Over-Parameterization Dilemma: A Volterra Filtering Approach for Action Recognition

10/21/2019
by   Siddharth Roheda, et al.
0

The importance of inference in Machine Learning (ML) has led to an explosive number of different proposals in ML, and particularly in Deep Learning. In an attempt to reduce the complexity of Convolutional Neural Networks, we propose a Volterra filter-inspired Network architecture. This architecture introduces controlled non-linearities in the form of interactions between the delayed input samples of data. We propose a cascaded implementation of Volterra Filter so as to significantly reduce the number of parameters required to carry out the same classification task as that of a conventional Neural Network. We demonstrate an efficient parallel implementation of this new Volterra network, along with its remarkable performance while retaining a relatively simpler and potentially more tractable structure. Furthermore, we show a rather sophisticated adaptation of this network to nonlinearly fuse the RGB (spatial) information and the Optical Flow (temporal) information of a video sequence for action recognition. The proposed approach is evaluated on UCF-101 and HMDB-51 datasets for action recognition, and is shown to outperform state of the art when trained on the datasets from scratch (i.e. without pre-training on a larger dataset).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2018

Motion Feature Network: Fixed Motion Filter for Action Recognition

Spatio-temporal representations in frame sequences play an important rol...
research
11/24/2018

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks

Different from RGB videos, depth data in RGB-D videos provide key comple...
research
09/18/2019

Global Temporal Representation based CNNs for Infrared Action Recognition

Infrared human action recognition has many advantages, i.e., it is insen...
research
07/13/2022

Is Appearance Free Action Recognition Possible?

Intuition might suggest that motion and dynamic information are key to v...
research
08/10/2020

2nd Place Scheme on Action Recognition Track of ECCV 2020 VIPriors Challenges: An Efficient Optical Flow Stream Guided Framework

To address the problem of training on small datasets for action recognit...
research
06/15/2019

A high performance computing method for accelerating temporal action proposal generation

Temporal action proposal generation, coming from temporal action recogni...
research
11/29/2018

Iterative Projection and Matching: Finding Structure-preserving Representatives and Its Application to Computer Vision

The goal of data selection is to capture the most structural information...

Please sign up or login with your details

Forgot password? Click here to reset