Large-Scale Image Retrieval with Attentive Deep Local Features

12/19/2016
by   Hyeonwoo Noh, et al.
0

We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELF (DEep Local Feature). The new feature is based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset. To identify semantically useful local features for image retrieval, we also propose an attention mechanism for keypoint selection, which shares most network layers with the descriptor. This framework can be used for image retrieval as a drop-in replacement for other keypoint detectors and descriptors, enabling more accurate feature matching and geometric verification. Our system produces reliable confidence scores to reject false positives---in particular, it is robust against queries that have no correct match in the database. To evaluate the proposed descriptor, we introduce a new large-scale dataset, referred to as Google-Landmarks dataset, which involves challenges in both database and query such as background clutter, partial occlusion, multiple landmarks, objects in variable scales, etc. We show that DELF outperforms the state-of-the-art global and local descriptors in the large-scale setting by significant margins.

READ FULL TEXT

page 1

page 3

page 7

page 8

research
04/05/2016

Deep Image Retrieval: Learning global representations for image search

We propose a novel approach for instance-level image retrieval. It produ...
research
03/22/2021

Instance-level Image Retrieval using Reranking Transformers

Instance-level image retrieval is the task of searching in a large datab...
research
07/05/2022

Object-Level Targeted Selection via Deep Template Matching

Retrieving images with objects that are semantically similar to objects ...
research
06/27/2015

Keypoint Encoding for Improved Feature Extraction from Compressed Video at Low Bitrates

In many mobile visual analysis applications, compressed video is transmi...
research
07/10/2023

Efficient Match Pair Retrieval for Large-scale UAV Images via Graph Indexed Global Descriptor

SfM (Structure from Motion) has been extensively used for UAV (Unmanned ...
research
08/03/2018

Late Fusion of Local Indexing and Deep Feature Scores for Fast Image-to-Video Search on Large-Scale Databases

Low cost visual representation and fast query-by-example content search ...
research
04/08/2019

ContextDesc: Local Descriptor Augmentation with Cross-Modality Context

Most existing studies on learning local features focus on the patch-base...

Please sign up or login with your details

Forgot password? Click here to reset