FootAndBall: Integrated player and ball detector

12/10/2019
by   Jacek Komorowski, et al.
23

The paper describes a deep neural network-based detector dedicated for ball and players detection in high resolution, long shot, video recordings of soccer matches. The detector, dubbed FootAndBall, has an efficient fully convolutional architecture and can operate on input video stream with an arbitrary resolution. It produces ball confidence map encoding the position of the detected ball, player confidence map and player bounding boxes tensor encoding players' positions and bounding boxes. The network uses Feature Pyramid Network desing pattern, where lower level features with higher spatial resolution are combined with higher level features with bigger receptive field. This improves discriminability of small objects (the ball) as larger visual context around the object of interest is taken into account for the classification. Due to its specialized design, the network has two orders of magnitude less parameters than a generic deep neural network-based object detector, such as SSD or YOLO. This allows real-time processing of high resolution input video stream.

READ FULL TEXT

page 1

page 5

page 9

research
02/19/2019

DeepBall: Deep Neural-Network Ball Detector

The paper describes a deep network based object detector specialized for...
research
12/14/2020

FasteNet: A Fast Railway Fastener Detector

In this work, a novel high-speed railway fastener detector is introduced...
research
05/22/2018

OmniDetector: With Neural Networks to Bounding Boxes

We propose a person detector on omnidirectional images, an accurate meth...
research
08/27/2021

FOVEA: Foveated Image Magnification for Autonomous Navigation

Efficient processing of high-resolution video streams is safety-critical...
research
12/10/2018

EDF: Ensemble, Distill, and Fuse for Easy Video Labeling

We present a way to rapidly bootstrap object detection on unseen videos ...
research
05/29/2020

Fixed-size Objects Encoding for Visual Relationship Detection

In this paper, we propose a fixed-size object encoding method (FOE-VRD) ...
research
04/21/2020

TTNet: Real-time temporal and spatial video analysis of table tennis

We present a neural network TTNet aimed at real-time processing of high-...

Please sign up or login with your details

Forgot password? Click here to reset