An Effective Way to Improve YouTube-8M Classification Accuracy in Google Cloud Platform

06/26/2017
by   Zhenzhen Zhong, et al.
0

Large-scale datasets have played a significant role in progress of neural network and deep learning areas. YouTube-8M is such a benchmark dataset for general multi-label video classification. It was created from over 7 million YouTube videos (450,000 hours of video) and includes video labels from a vocabulary of 4716 classes (3.4 labels/video on average). It also comes with pre-extracted audio & visual features from every second of video (3.2 billion feature vectors in total). Google cloud recently released the datasets and organized 'Google Cloud & YouTube-8M Video Understanding Challenge' on Kaggle. Competitors are challenged to develop classification algorithms that assign video-level labels using the new and improved Youtube-8M V2 dataset. Inspired by the competition, we started exploration of audio understanding and classification using deep learning algorithms and ensemble methods. We built several baseline predictions according to the benchmark paper and public github tensorflow code. Furthermore, we improved global prediction accuracy (GAP) from base level 77

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2017

Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

This paper describes our solution for the video recognition task of the ...
research
09/27/2016

YouTube-8M: A Large-Scale Video Classification Benchmark

Many recent advancements in Computer Vision are attributed to large data...
research
06/14/2017

Deep Learning Methods for Efficient Large Scale Video Labeling

We present a solution to "Google Cloud and YouTube-8M Video Understandin...
research
09/29/2018

Non-local NetVLAD Encoding for Video Classification

This paper describes our solution for the 2^nd YouTube-8M video understa...
research
06/15/2017

Hierarchical Label Inference for Video Classification

Videos are a rich source of high-dimensional structured data, with a wid...
research
07/13/2017

Cultivating DNN Diversity for Large Scale Video Labelling

We investigate factors controlling DNN diversity in the context of the G...
research
07/13/2017

Large-scale Video Classification guided by Batch Normalized LSTM Translator

Youtube-8M dataset enhances the development of large-scale video recogni...

Please sign up or login with your details

Forgot password? Click here to reset