Exploring Temporal Differences in 3D Convolutional Neural Networks

09/07/2019
by   Gagan Kanojia, et al.
0

Traditional 3D convolutions are computationally expensive, memory intensive, and due to large number of parameters, they often tend to overfit. On the other hand, 2D CNNs are less computationally expensive and less memory intensive than 3D CNNs and have shown remarkable results in applications like image classification and object recognition. However, in previous works, it has been observed that they are inferior to 3D CNNs when applied on a spatio-temporal input. In this work, we propose a convolutional block which extracts the spatial information by performing a 2D convolution and extracts the temporal information by exploiting temporal differences, i.e., the change in the spatial information at different time instances, using simple operations of shift, subtract and add without utilizing any trainable parameters. The proposed convolutional block has same number of parameters as of a 2D convolution kernel of size nxn, i.e. n^2, and has n times lesser parameters than an nxnxn 3D convolution kernel. We show that the 3D CNNs perform better when the 3D convolution kernels are replaced by the proposed convolutional blocks. We evaluate the proposed convolutional block on UCF101 and ModelNet datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2020

Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human Action Recognition

Conventional 3D convolutional neural networks (CNNs) are computationally...
research
12/09/2019

Temporal Factorization of 3D Convolutional Kernels

3D convolutional neural networks are difficult to train because they are...
research
11/17/2020

Multigrid-in-Channels Neural Network Architectures

We present a multigrid-in-channels (MGIC) approach that tackles the quad...
research
09/30/2019

Spatio-Temporal FAST 3D Convolutions for Human Action Recognition

Effective processing of video input is essential for the recognition of ...
research
01/04/2021

DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions

As the key advancement of the convolutional neural networks (CNNs), dept...
research
09/30/2018

Benchmarks of ResNet Architecture for Atrial Fibrillation Classification

In this work we apply variations of ResNet architecture to the task of a...
research
08/12/2019

Enhanced 3D convolutional networks for crowd counting

Recently, convolutional neural networks (CNNs) are the leading defacto m...

Please sign up or login with your details

Forgot password? Click here to reset