VideoGraph: Recognizing Minutes-Long Human Activities in Videos

05/13/2019
by   Noureldien Hussein, et al.
16

Many human activities take minutes to unfold. To represent them, related works opt for statistical pooling, which neglects the temporal structure. Others opt for convolutional methods, as CNN and Non-Local. While successful in learning temporal concepts, they are short of modeling minutes-long temporal dependencies. We propose VideoGraph, a method to achieve the best of two worlds: represent minutes-long human activities and learn their underlying temporal structure. VideoGraph learns a graph-based representation for human activities. The graph, its nodes and edges are learned entirely from video datasets, making VideoGraph applicable to problems without node-level annotation. The result is improvements over related works on benchmarks: Epic-Kitchen and Breakfast. Besides, we demonstrate that VideoGraph is able to learn the temporal structure of human activities in minutes-long videos.

READ FULL TEXT
research
12/04/2018

Timeception for Complex Action Recognition

This paper focuses on the temporal aspect for recognizing human activiti...
research
03/03/2021

Learning Asynchronous and Sparse Human-Object Interaction in Videos

Human activities can be learned from video. With effective modeling it i...
research
01/26/2022

Learning To Recognize Procedural Activities with Distant Supervision

In this paper we consider the problem of classifying fine-grained, multi...
research
07/25/2016

Much Ado About Time: Exhaustive Annotation of Temporal Data

Large-scale annotated datasets allow AI systems to learn from and build ...
research
05/05/2021

An Exploratory Study of Debugging Episodes

Many studies have long investigated how developers debug, shaping our un...
research
05/04/2023

Notes on Refactoring Exponential Macros in Common Lisp

I recently consulted for a very big Common Lisp project having more than...
research
09/02/2020

Long-Term Anticipation of Activities with Cycle Consistency

With the success of deep learning methods in analyzing activities in vid...

Please sign up or login with your details

Forgot password? Click here to reset