Learning ordered pooling weights in image classification

07/02/2020
by   J. I. Forcen, et al.
0

Spatial pooling is an important step in computer vision systems like Convolutional Neural Networks or the Bag-of-Words method. The spatial pooling purpose is to combine neighbouring descriptors to obtain a single descriptor for a given region (local or global). The resultant combined vector must be as discriminant as possible, in other words, must contain relevant information, while removing irrelevant and confusing details. Maximum and average are the most common aggregation functions used in the pooling step. To improve the aggregation of relevant information without degrading their discriminative power for image classification, we introduce a simple but effective scheme based on Ordered Weighted Average (OWA) aggregation operators. We present a method to learn the weights of the OWA aggregation operator in a Bag-of-Words framework and in Convolutional Neural Networks, and provide an extensive evaluation showing that OWA based pooling outperforms classical aggregation operators.

READ FULL TEXT

page 10

page 11

research
12/30/2014

Domain-Size Pooling in Local Descriptors: DSP-SIFT

We introduce a simple modification of local image descriptors, such as S...
research
07/23/2014

Visual Word Selection without Re-Coding and Re-Pooling

The Bag-of-Words (BoW) representation is widely used in computer vision....
research
01/16/2013

Stochastic Pooling for Regularization of Deep Convolutional Neural Networks

We introduce a simple and effective method for regularizing large convol...
research
06/10/2019

A Closed-Form Learned Pooling for Deep Classification Networks

In modern computer vision tasks, convolutional neural networks (CNNs) ar...
research
04/08/2018

Ordinal Pooling Networks: For Preserving Information over Shrinking Feature Maps

In the framework of convolutional neural networks that lie at the heart ...
research
01/15/2013

Auto-pooling: Learning to Improve Invariance of Image Features from Image Sequences

Learning invariant representations from images is one of the hardest cha...
research
02/27/2020

RNNPool: Efficient Non-linear Pooling for RAM Constrained Inference

Pooling operators are key components in most Convolutional Neural Networ...

Please sign up or login with your details

Forgot password? Click here to reset