Deep Learning Methods for Efficient Large Scale Video Labeling

06/14/2017
by   Miha Skalic, et al.
0

We present a solution to "Google Cloud and YouTube-8M Video Understanding Challenge" that ranked 5th place. The proposed model is an ensemble of three model families, two frame level and one video level. The training was performed on augmented dataset, with cross validation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2017

Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding

This paper describes our solution for the video recognition task of the ...
research
10/25/2019

Learning to Localize Temporal Events in Large-scale Video Data

We address temporal localization of events in large-scale video data, in...
research
07/13/2017

Cultivating DNN Diversity for Large Scale Video Labelling

We investigate factors controlling DNN diversity in the context of the G...
research
09/12/2018

Label Denoising with Large Ensembles of Heterogeneous Neural Networks

Despite recent advances in computer vision based on various convolutiona...
research
09/29/2018

Non-local NetVLAD Encoding for Video Classification

This paper describes our solution for the 2^nd YouTube-8M video understa...
research
06/26/2017

An Effective Way to Improve YouTube-8M Classification Accuracy in Google Cloud Platform

Large-scale datasets have played a significant role in progress of neura...
research
12/02/2019

BERT for Large-scale Video Segment Classification with Test-time Augmentation

This paper presents our approach to the third YouTube-8M video understan...

Please sign up or login with your details

Forgot password? Click here to reset