Approximate Fisher Kernels of non-iid Image Models for Image Categorization

10/03/2015
by   Ramazan Gokberk Cinbis, et al.
0

The bag-of-words (BoW) model treats images as sets of local descriptors and represents them by visual word histograms. The Fisher vector (FV) representation extends BoW, by considering the first and second order statistics of local descriptors. In both representations local descriptors are assumed to be identically and independently distributed (iid), which is a poor assumption from a modeling perspective. It has been experimentally observed that the performance of BoW and FV representations can be improved by employing discounting transformations such as power normalization. In this paper, we introduce non-iid models by treating the model parameters as latent variables which are integrated out, rendering all local regions dependent. Using the Fisher kernel principle we encode an image by the gradient of the data log-likelihood w.r.t. the model hyper-parameters. Our models naturally generate discounting effects in the representations; suggesting that such transformations have proven successful because they closely correspond to the representations obtained for non-iid models. To enable tractable computation, we rely on variational free-energy bounds to learn the hyper-parameters and to compute approximate Fisher kernels. Our experimental evaluation results validate that our models lead to performance improvements comparable to using power normalization, as employed in state-of-the-art feature aggregation methods.

READ FULL TEXT

page 1

page 2

page 3

page 14

research
11/24/2016

Interferences in match kernels

We consider the design of an image representation that embeds and aggreg...
research
06/26/2017

Learning Local Feature Aggregation Functions with Backpropagation

This paper introduces a family of local feature aggregation functions an...
research
12/21/2020

Nonstationarity Analysis of Materials Microstructures via Fisher Score Vectors

Microstructures are critical to the physical properties of materials. St...
research
10/15/2014

Efficient Image Categorization with Sparse Fisher Vector

In object recognition, Fisher vector (FV) representation is one of the s...
research
04/15/2016

Probing the Intra-Component Correlations within Fisher Vector for Material Classification

Fisher vector (FV) has become a popular image representation. One notabl...
research
07/08/2014

Orientation covariant aggregation of local descriptors with embeddings

Image search systems based on local descriptors typically achieve orient...
research
03/13/2015

Hybrid multi-layer Deep CNN/Aggregator feature for image classification

Deep Convolutional Neural Networks (DCNN) have established a remarkable ...

Please sign up or login with your details

Forgot password? Click here to reset