DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling

06/05/2023
by   Gustavo Perez, et al.
0

Many modern applications use computer vision to detect and count objects in massive image collections. However, when the detection task is very difficult or in the presence of domain shifts, the counts may be inaccurate even with significant investments in training data and model development. We propose DISCount – a detector-based importance sampling framework for counting in large image collections that integrates an imperfect detector with human-in-the-loop screening to produce unbiased estimates of counts. We propose techniques for solving counting problems over multiple spatial or temporal regions using a small number of screened samples and estimate confidence intervals. This enables end-users to stop screening when estimates are sufficiently accurate, which is often the goal in a scientific study. On the technical side we develop variance reduction techniques based on control variates and prove the (conditional) unbiasedness of the estimators. DISCount leads to a 9-12x reduction in the labeling costs over naive screening for tasks we consider, such as counting birds in radar imagery or estimating damaged buildings in satellite imagery, and also surpasses alternative covariate-based screening approaches in efficiency.

READ FULL TEXT

page 2

page 18

page 19

research
01/10/2019

The square root rule for adaptive importance sampling

In adaptive importance sampling, and other contexts, we have unbiased an...
research
06/29/2021

Detecting Cattle and Elk in the Wild from Space

Localizing and counting large ungulates – hoofed mammals like cows and e...
research
10/26/2021

Cross-Region Building Counting in Satellite Imagery using Counting Consistency

Estimating the number of buildings in any geographical region is a vital...
research
09/24/2021

Sample Efficient Model Evaluation

Labelling data is a major practical bottleneck in training and testing c...
research
09/13/2021

Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories

For machine learning models trained with limited labeled training data, ...
research
02/02/2019

Stochastic Enumeration with Importance Sampling

Many hard problems in the computational sciences are equivalent to count...

Please sign up or login with your details

Forgot password? Click here to reset