Generalized Max Pooling

06/02/2014
by   Naila Murray, et al.
0

State-of-the-art patch-based image representations involve a pooling operation that aggregates statistics computed from local descriptors. Standard pooling operations include sum- and max-pooling. Sum-pooling lacks discriminability because the resulting representation is strongly influenced by frequent yet often uninformative descriptors, but only weakly influenced by rare yet potentially highly-informative ones. Max-pooling equalizes the influence of frequent and rare descriptors but is only applicable to representations that rely on count statistics, such as the bag-of-visual-words (BOV) and its soft- and sparse-coding extensions. We propose a novel pooling mechanism that achieves the same effect as max-pooling but is applicable beyond the BOV and especially to the state-of-the-art Fisher Vector -- hence the name Generalized Max Pooling (GMP). It involves equalizing the similarity between each patch and the pooled representation, which is shown to be equivalent to re-weighting the per-patch statistics. We show on five public image classification benchmarks that the proposed GMP can lead to significant performance gains with respect to heuristic alternatives.

READ FULL TEXT
research
08/14/2019

Deep Generalized Max Pooling

Global pooling layers are an essential part of Convolutional Neural Netw...
research
10/15/2014

Efficient Image Categorization with Sparse Fisher Vector

In object recognition, Fisher vector (FV) representation is one of the s...
research
03/03/2020

multi-patch aggregation models for resampling detection

Images captured nowadays are of varying dimensions with smartphones and ...
research
05/31/2021

Analysis of convolutional neural network image classifiers in a hierarchical max-pooling model with additional local pooling

Image classification is considered, and a hierarchical max-pooling model...
research
11/29/2017

Colour Constancy: Biologically-inspired Contrast Variant Pooling Mechanism

Pooling is a ubiquitous operation in image processing algorithms that al...
research
12/30/2014

Domain-Size Pooling in Local Descriptors: DSP-SIFT

We introduce a simple modification of local image descriptors, such as S...
research
03/02/2022

The Theoretical Expressiveness of Maxpooling

Over the decade since deep neural networks became state of the art image...

Please sign up or login with your details

Forgot password? Click here to reset