Detecting the Starting Frame of Actions in Video

06/07/2019
by   Iljung S. Kwak, et al.
9

To understand causal relationships between events in the world, it is useful to pinpoint when actions occur in videos and to examine the state of the world at and around that time point. For example, one must accurately detect the start of an audience response -- laughter in a movie, cheering at a sporting event -- to understand the cause of the reaction. In this work, we focus on the problem of accurately detecting action starts rather than isolated events or action ends. We introduce a novel structured loss function based on matching predictions to true action starts that is tailored to this problem; it more heavily penalizes extra and missed action start detections over small misalignments. Recurrent neural networks are used to minimize a differentiable approximation of this loss. To evaluate these methods, we introduce the Mouse Reach Dataset, a large, annotated video dataset of mice performing a sequence of actions. The dataset was labeled by experts for the purpose of neuroscience research on causally relating neural activity to behavior. On this dataset, we demonstrate that the structured loss leads to significantly higher accuracy than a baseline of mean-squared error loss.

READ FULL TEXT

page 2

page 4

page 15

page 16

page 20

page 21

page 22

research
12/03/2019

A Context-Aware Loss Function for Action Spotting in Soccer Videos

Action spotting is an important element of general activity understandin...
research
06/12/2020

ESAD: Endoscopic Surgeon Action Detection Dataset

In this work, we take aim towards increasing the effectiveness of surgic...
research
11/17/2015

Deep multi-scale video prediction beyond mean square error

Learning to predict future images from a video sequence involves the con...
research
06/13/2017

Action Search: Learning to Search for Human Activities in Untrimmed Videos

Traditional approaches for action detection use trimmed data to learn so...
research
08/18/2023

Progression-Guided Temporal Action Detection in Videos

We present a novel framework, Action Progression Network (APN), for temp...
research
03/23/2019

StartNet: Online Detection of Action Start in Untrimmed Videos

We propose StartNet to address Online Detection of Action Start (ODAS) w...
research
04/21/2016

Online Action Detection

In online action detection, the goal is to detect the start of an action...

Please sign up or login with your details

Forgot password? Click here to reset