The Caltech Fish Counting Dataset: A Benchmark for Multiple-Object Tracking and Counting

07/19/2022
by   Justin Kay, et al.
34

We present the Caltech Fish Counting Dataset (CFC), a large-scale dataset for detecting, tracking, and counting fish in sonar videos. We identify sonar videos as a rich source of data for advancing low signal-to-noise computer vision applications and tackling domain generalization in multiple-object tracking (MOT) and counting. In comparison to existing MOT and counting datasets, which are largely restricted to videos of people and vehicles in cities, CFC is sourced from a natural-world domain where targets are not easily resolvable and appearance features cannot be easily leveraged for target re-identification. With over half a million annotations in over 1,500 videos sourced from seven different sonar cameras, CFC allows researchers to train MOT and counting algorithms and evaluate generalization performance at unseen test locations. We perform extensive baseline experiments and identify key challenges and opportunities for advancing the state of the art in generalization in MOT and counting.

READ FULL TEXT

page 6

page 7

page 11

page 26

page 28

page 32

research
04/18/2013

Object Tracking in Videos: Approaches and Issues

Mobile object tracking has an important role in the computer vision appl...
research
12/12/2022

CountingMOT: Joint Counting, Detection and Re-Identification for Multiple Object Tracking

The recent trend in multiple object tracking (MOT) is jointly solving de...
research
02/17/2022

Domain Randomization for Object Counting

Recently, the use of synthetic datasets based on game engines has been s...
research
06/27/2020

Counting Out Time: Class Agnostic Video Repetition Counting in the Wild

We present an approach for estimating the period with which an action is...
research
11/15/2017

People, Penguins and Petri Dishes: Adapting Object Counting Models To New Visual Domains And Object Types Without Forgetting

In this paper we propose a technique to adapt a convolutional neural net...
research
12/17/2020

Clique: Spatiotemporal Object Re-identification at the City Scale

Object re-identification (ReID) is a key application of city-scale camer...
research
05/08/2020

Introduction of a new Dataset and Method for Detecting and Counting the Pistachios based on Deep Learning

Pistachio is a nutritious nut that has many uses in the food industry. I...

Please sign up or login with your details

Forgot password? Click here to reset