AutoScaler: Scale-Attention Networks for Visual Correspondence

11/17/2016
by   Shenlong Wang, et al.
0

Finding visual correspondence between local features is key to many computer vision problems. While defining features with larger contextual scales usually implies greater discriminativeness, it could also lead to less spatial accuracy of the features. We propose AutoScaler, a scale-attention network to explicitly optimize this trade-off in visual correspondence tasks. Our network consists of a weight-sharing feature network to compute multi-scale feature maps and an attention network to combine them optimally in the scale space. This allows our network to have adaptive receptive field sizes over different scales of the input. The entire network is trained end-to-end in a siamese framework for visual correspondence tasks. Our method achieves favorable results compared to state-of-the-art methods on challenging optical flow and semantic matching benchmarks, including Sintel, KITTI and CUB-2011. We also show that our method can generalize to improve hand-crafted descriptors (e.g Daisy) on general visual correspondence tasks. Finally, our attention network can generate visually interpretable scale attention maps.

READ FULL TEXT

page 4

page 5

page 6

page 8

research
07/31/2021

Multi-scale Matching Networks for Semantic Correspondence

Deep features have been proven powerful in building accurate dense seman...
research
07/23/2017

Deep Optical Flow Estimation Via Multi-Scale Correspondence Structure Learning

As an important and challenging problem in computer vision, learning bas...
research
08/03/2018

Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification

Local features at neighboring spatial positions in feature maps have hig...
research
01/20/2020

BARNet: Bilinear Attention Network with Adaptive Receptive Field for Surgical Instrument Segmentation

Surgical instrument segmentation is extremely important for computer-ass...
research
05/23/2023

Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence

Diffusion models have been shown to be capable of generating high-qualit...
research
01/17/2019

SAFE: Scale Aware Feature Encoder for Scene Text Recognition

In this paper, we address the problem of having characters with differen...
research
12/20/2021

Contrastive Attention Network with Dense Field Estimation for Face Completion

Most modern face completion approaches adopt an autoencoder or its varia...

Please sign up or login with your details

Forgot password? Click here to reset