DenserNet: Weakly Supervised Visual Localization Using Multi-scale Feature Aggregation

12/04/2020
by   Dongfang Liu, et al.
1

In this work, we introduce a Denser Feature Network (DenserNet) for visual localization. Our work provides three principal contributions. First, we develop a convolutional neural network (CNN) architecture which aggregates feature maps at different semantic levels for image representations. Using denser feature maps, our method can produce more keypoint features and increase image retrieval accuracy. Second, our model is trained end-to-end without pixel-level annotation other than positive and negative GPS-tagged image pairs. We use a weakly supervised triplet ranking loss to learn discriminative features and encourage keypoint feature repeatability for image representation. Finally, our method is computationally efficient as our architecture has shared features and parameters during computation. Our method can perform accurate large-scale localization under challenging conditions while remaining the computational constraint. Extensive experiment results indicate that our method sets a new state-of-the-art on four challenging large-scale localization benchmarks and three image retrieval benchmarks.

READ FULL TEXT

page 1

page 2

page 6

page 7

page 10

page 11

research
02/18/2021

Hierarchical Attention Fusion for Geo-Localization

Geo-localization is a critical task in computer vision. In this work, we...
research
04/21/2020

Image Retrieval using Multi-scale CNN Features Pooling

In this paper, we address the problem of image retrieval by learning ima...
research
11/19/2018

Weakly Supervised Soft-detection-based Aggregation Method for Image Retrieval

In recent year, the compact representations based on activations of Conv...
research
08/27/2020

Learning Condition Invariant Features for Retrieval-Based Localization from 1M Images

Image features for retrieval-based localization must be invariant to dyn...
research
07/12/2019

ACTNET: end-to-end learning of feature activations and multi-stream aggregation for effective instance image retrieval

We propose a novel CNN architecture called ACTNET for robust instance im...
research
08/09/2016

OnionNet: Sharing Features in Cascaded Deep Classifiers

The focus of our work is speeding up evaluation of deep neural networks ...
research
07/12/2019

ACTNET: end-to-end learning of feature activations and aggregation for effective instance image retrieval

We propose a novel CNN architecture called ACTNET for robust instance im...

Please sign up or login with your details

Forgot password? Click here to reset