Orientation covariant aggregation of local descriptors with embeddings

07/08/2014
by   Giorgos Tolias, et al.
0

Image search systems based on local descriptors typically achieve orientation invariance by aligning the patches on their dominant orientations. Albeit successful, this choice introduces too much invariance because it does not guarantee that the patches are rotated consistently. This paper introduces an aggregation strategy of local descriptors that achieves this covariance property by jointly encoding the angle in the aggregation stage in a continuous manner. It is combined with an efficient monomial embedding to provide a codebook-free method to aggregate local descriptors into a single vector representation. Our strategy is also compatible and employed with several popular encoding methods, in particular bag-of-words, VLAD and the Fisher vector. Our geometric-aware aggregation strategy is effective for image search, as shown by experiments performed on standard benchmarks for image and particular object retrieval, namely Holidays and Oxford buildings.

READ FULL TEXT
research
11/24/2016

Interferences in match kernels

We consider the design of an image representation that embeds and aggreg...
research
06/26/2017

Learning Local Feature Aggregation Functions with Backpropagation

This paper introduces a family of local feature aggregation functions an...
research
10/24/2019

ProLFA: Representative Prototype Selection for Local Feature Aggregation

Given a set of hand-crafted local features, acquiring a global represent...
research
07/17/2020

Online Invariance Selection for Local Feature Descriptors

To be invariant, or not to be invariant: that is the question formulated...
research
09/01/2016

Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition

Traditional feature encoding scheme (e.g., Fisher vector) with local des...
research
01/19/2021

Hyperdimensional computing as a framework for systematic aggregation of image descriptors

Image and video descriptors are an omnipresent tool in computer vision a...
research
10/03/2015

Approximate Fisher Kernels of non-iid Image Models for Image Categorization

The bag-of-words (BoW) model treats images as sets of local descriptors ...

Please sign up or login with your details

Forgot password? Click here to reset