Non-local NetVLAD Encoding for Video Classification

09/29/2018
by   Yongyi Tang, et al.
0

This paper describes our solution for the 2^nd YouTube-8M video understanding challenge organized by Google AI. Unlike the video recognition benchmarks, such as Kinetics and Moments, the YouTube-8M challenge provides pre-extracted visual and audio features instead of raw videos. In this challenge, the submitted model is restricted to 1GB, which encourages participants focus on constructing one powerful single model rather than incorporating of the results from a bunch of models. Our system fuses six different sub-models into one single computational graph, which are categorized into three families. More specifically, the most effective family is the model with non-local operations following the NetVLAD encoding. The other two family models are Soft-BoF and GRU, respectively. In order to further boost single models performance, the model parameters of different checkpoints are averaged. Experimental results demonstrate that our proposed system can effectively perform the video classification task, achieving 0.88763 on the public test set and 0.88704 on the private set in terms of GAP@20, respectively. We finally ranked at the fourth place in the YouTube-8M video understanding challenge.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2017

Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

This paper describes our solution for the video recognition task of the ...
research
06/14/2017

Deep Learning Methods for Efficient Large Scale Video Labeling

We present a solution to "Google Cloud and YouTube-8M Video Understandin...
research
06/26/2017

An Effective Way to Improve YouTube-8M Classification Accuracy in Google Cloud Platform

Large-scale datasets have played a significant role in progress of neura...
research
09/21/2018

Large-Scale Video Classification with Feature Space Augmentation coupled with Learned Label Relations and Ensembling

This paper presents the Axon AI's solution to the 2nd YouTube-8M Video U...
research
01/15/2019

Measuring Effectiveness of Video Advertisements

Advertisements are unavoidable in modern society. Times Square is notori...
research
06/26/2017

YouTube-8M Video Understanding Challenge Approach and Applications

This paper introduces the YouTube-8M Video Understanding Challenge hoste...
research
02/07/2020

iqiyi Submission to ActivityNet Challenge 2019 Kinetics-700 challenge: Hierarchical Group-wise Attention

In this report, the method for the iqiyi submission to the task of Activ...

Please sign up or login with your details

Forgot password? Click here to reset