Capturing spatial interdependence in image features: the counting grid, an epitomic representation for bags of features

10/23/2014
by   Alessandro Perina, et al.
0

In recent scene recognition research images or large image regions are often represented as disorganized "bags" of features which can then be analyzed using models originally developed to capture co-variation of word counts in text. However, image feature counts are likely to be constrained in different ways than word counts in text. For example, as a camera pans upwards from a building entrance over its first few floors and then further up into the sky Fig. 1, some feature counts in the image drop while others rise -- only to drop again giving way to features found more often at higher elevations. The space of all possible feature count combinations is constrained both by the properties of the larger scene and the size and the location of the window into it. To capture such variation, in this paper we propose the use of the counting grid model. This generative model is based on a grid of feature counts, considerably larger than any of the modeled images, and considerably smaller than the real estate needed to tile the images next to each other tightly. Each modeled image is assumed to have a representative window in the grid in which the feature counts mimic the feature distribution in the image. We provide a learning procedure that jointly maps all images in the training set to the counting grid and estimates the appropriate local counts in it. Experimentally, we demonstrate that the resulting representation captures the space of feature count combinations more accurately than the traditional models, not only when the input images come from a panning camera, but even when modeling images of different scenes from the same category.

READ FULL TEXT

page 2

page 3

page 8

page 9

page 10

page 12

research
04/06/2019

Towards Locally Consistent Object Counting with Constrained Multi-stage Convolutional Neural Networks

High-density object counting in surveillance scenes is challenging mainl...
research
05/05/2022

BlobGAN: Spatially Disentangled Scene Representations

We propose an unsupervised, mid-level representation for a generative mo...
research
10/30/2020

Patterns Count-Based Labels for Datasets

Counts of attribute-value combinations are central to the profiling of a...
research
03/12/2015

Hierarchical learning of grids of microtopics

The counting grid is a grid of microtopics, sparse word/feature distribu...
research
07/30/2015

People Counting in High Density Crowds from Still Images

We present a method of estimating the number of people in high density c...
research
01/19/2020

The Power of Pivoting for Exact Clique Counting

Clique counting is a fundamental task in network analysis, and even the ...
research
09/13/2022

Computer vision system to count crustacean larvae

Fish products account for about 16 percent of the human diet worldwide, ...

Please sign up or login with your details

Forgot password? Click here to reset