Event detection in coarsely annotated sports videos via parallel multi receptive field 1D convolutions

04/13/2020
by   Kanav Vats, et al.
0

In problems such as sports video analytics, it is difficult to obtain accurate frame level annotations and exact event duration because of the lengthy videos and sheer volume of video data. This issue is even more pronounced in fast-paced sports such as ice hockey. Obtaining annotations on a coarse scale can be much more practical and time efficient. We propose the task of event detection in coarsely annotated videos. We introduce a multi-tower temporal convolutional network architecture for the proposed task. The network, with the help of multiple receptive fields, processes information at various temporal scales to account for the uncertainty with regard to the exact event location and duration. We demonstrate the effectiveness of the multi-receptive field architecture through appropriate ablation studies. The method is evaluated on two tasks - event detection in coarsely annotated hockey videos in the NHL dataset and event spotting in soccer on the SoccerNet dataset. The two datasets lack frame-level annotations and have very distinct event frequencies. Experimental results demonstrate the effectiveness of the network by obtaining a 55 performance compared to the state of the art on the SoccerNet dataset. We believe our approach will help develop more practical pipelines for event detection in sports video.

READ FULL TEXT

page 5

page 6

research
11/23/2021

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Event analysis in untrimmed videos has attracted increasing attention du...
research
09/30/2021

Workflow Augmentation of Video Data for Event Recognition with Time-Sensitive Neural Networks

Supervised training of neural networks requires large, diverse and well ...
research
09/27/2021

Joint Multimedia Event Extraction from Video and Article

Visual and textual modalities contribute complementary information about...
research
12/11/2019

PuckNet: Estimating hockey puck location from broadcast video

Puck location in ice hockey is essential for hockey analysts for determi...
research
11/09/2015

Detecting events and key actors in multi-person videos

Multi-person event recognition is a challenging task, often with many pe...
research
06/07/2022

Structured Context Transformer for Generic Event Boundary Detection

Generic Event Boundary Detection (GEBD) aims to detect moments where hum...
research
09/04/2015

Learning Temporal Alignment Uncertainty for Efficient Event Detection

In this paper we tackle the problem of efficient video event detection. ...

Please sign up or login with your details

Forgot password? Click here to reset