High Order Neural Networks for Video Classification

11/19/2018
by   Jie Shao, et al.
0

Capturing spatiotemporal correlations is an essential topic in video classification. In this paper, we present high order operations as a generic family of building blocks for capturing high order correlations from high dimensional input video space. We prove that several successful architectures for visual classification tasks are in the family of high order neural networks, theoretical and experimental analysis demonstrates their underlying mechanism is high order. We also proposal a new LEarnable hiGh Order (LEGO) block, whose goal is to capture spatiotemporal correlation in a feedforward manner. Specifically, LEGO blocks implicitly learn the relation expressions for spatiotemporal features and use the learned relations to weight input features. This building block can be plugged into many neural network architectures, achieving evident improvement without introducing much overhead. On the task of video classification, even using RGB only without fine-tuning with other video datasets, our high order models can achieve results on par with or better than the existing state-of-the-art methods on both Something-Something (V1 and V2) and Charades datasets.

READ FULL TEXT
research
11/24/2017

Appearance-and-Relation Networks for Video Classification

Spatiotemporal feature learning in videos is a fundamental and difficult...
research
11/12/2017

High-Order Attention Models for Visual Question Answering

The quest for algorithms that enable cognitive abilities is an important...
research
06/19/2018

Spatio-Temporal Channel Correlation Networks for Action Classification

The work in this paper is driven by the question if spatio-temporal corr...
research
11/04/2019

A Spectral Nonlocal Block for Neural Networks

The nonlocal network is designed for capturing long-range spatial-tempor...
research
08/16/2016

A Shallow High-Order Parametric Approach to Data Visualization and Compression

Explicit high-order feature interactions efficiently capture essential s...
research
10/09/2020

High-order Semantic Role Labeling

Semantic role labeling is primarily used to identify predicates, argumen...
research
04/29/2015

A Deep Learning Model for Structured Outputs with High-order Interaction

Many real-world applications are associated with structured data, where ...

Please sign up or login with your details

Forgot password? Click here to reset