Hierarchical Label Inference for Video Classification

06/15/2017
by   Nelson Nauata, et al.
0

Videos are a rich source of high-dimensional structured data, with a wide range of interacting components at varying levels of granularity. In order to improve understanding of unconstrained internet videos, it is important to consider the role of labels at separate levels of abstraction. In this paper, we consider the use of the Bidirectional Inference Neural Network (BINN) for performing graph-based inference in label space for the task of video classification. We take advantage of the inherent hierarchy between labels at increasing granularity. The BINN is evaluated on the first and second release of the YouTube-8M large scale multilabel video dataset. Our results demonstrate the effectiveness of BINN, achieving significant improvements against baseline models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2018

Structured Label Inference for Visual Understanding

Visual data such as images and videos contain a rich source of structure...
research
06/24/2017

Encoding Video and Label Priors for Multi-label Video Classification on YouTube-8M dataset

YouTube-8M is the largest video dataset for multi-label video classifica...
research
06/26/2017

An Effective Way to Improve YouTube-8M Classification Accuracy in Google Cloud Platform

Large-scale datasets have played a significant role in progress of neura...
research
09/27/2016

YouTube-8M: A Large-Scale Video Classification Benchmark

Many recent advancements in Computer Vision are attributed to large data...
research
07/20/2022

On Label Granularity and Object Localization

Weakly supervised object localization (WSOL) aims to learn representatio...
research
07/13/2017

Large-scale Video Classification guided by Batch Normalized LSTM Translator

Youtube-8M dataset enhances the development of large-scale video recogni...
research
09/21/2018

Large-Scale Video Classification with Feature Space Augmentation coupled with Learned Label Relations and Ensembling

This paper presents the Axon AI's solution to the 2nd YouTube-8M Video U...

Please sign up or login with your details

Forgot password? Click here to reset