Efficient On-the-fly Category Retrieval using ConvNets and GPUs

07/17/2014
by   Ken Chatfield, et al.
0

We investigate the gains in precision and speed, that can be obtained by using Convolutional Networks (ConvNets) for on-the-fly retrieval - where classifiers are learnt at run time for a textual query from downloaded images, and used to rank large image or video datasets. We make three contributions: (i) we present an evaluation of state-of-the-art image representations for object category retrieval over standard benchmark datasets containing 1M+ images; (ii) we show that ConvNets can be used to obtain features which are incredibly performant, and yet much lower dimensional than previous state-of-the-art image representations, and that their dimensionality can be reduced further without loss in performance by compression using product quantization or binarization. Consequently, features with the state-of-the-art performance on large-scale datasets of millions of images can fit in the memory of even a commodity GPU card; (iii) we show that an SVM classifier can be learnt within a ConvNet framework on a GPU in parallel with downloading the new training images, allowing for a continuous refinement of the model as more images become available, and simultaneous training and ranking. The outcome is an on-the-fly system that significantly outperforms its predecessors in terms of: precision of retrieval, memory requirements, and speed, facilitating accurate on-the-fly learning and ranking in under a second on a single GPU.

READ FULL TEXT

page 2

page 4

page 7

page 9

page 10

page 12

page 13

page 14

research
12/10/2015

VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products

VRFP is a real-time video retrieval framework based on short text input ...
research
02/06/2023

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval

Image-text retrieval (ITR) is a task to retrieve the relevant images/tex...
research
04/19/2020

Fast GPU 3D Diffeomorphic Image Registration

3D image registration is one of the most fundamental and computationally...
research
08/26/2023

Learning Efficient Representations for Image-Based Patent Retrieval

Patent retrieval has been attracting tremendous interest from researcher...
research
08/08/2020

GPU-Accelerated Primal Learning for Extremely Fast Large-Scale Classification

One of the most efficient methods to solve L2-regularized primal problem...
research
06/01/2018

Scaling Neural Machine Translation

Sequence to sequence learning models still require several days to reach...
research
08/27/2020

Learning Condition Invariant Features for Retrieval-Based Localization from 1M Images

Image features for retrieval-based localization must be invariant to dyn...

Please sign up or login with your details

Forgot password? Click here to reset