TopicFM+: Boosting Accuracy and Efficiency of Topic-Assisted Feature Matching

07/02/2023
by   Khang Truong Giang, et al.
0

This study tackles the challenge of image matching in difficult scenarios, such as scenes with significant variations or limited texture, with a strong emphasis on computational efficiency. Previous studies have attempted to address this challenge by encoding global scene contexts using Transformers. However, these approaches suffer from high computational costs and may not capture sufficient high-level contextual information, such as structural shapes or semantic instances. Consequently, the encoded features may lack discriminative power in challenging scenes. To overcome these limitations, we propose a novel image-matching method that leverages a topic-modeling strategy to capture high-level contexts in images. Our method represents each image as a multinomial distribution over topics, where each topic represents a latent semantic instance. By incorporating these topics, we can effectively capture comprehensive context information and obtain discriminative and high-quality features. Additionally, our method effectively matches features within corresponding semantic regions by estimating the covisible topics. To enhance the efficiency of feature matching, we have designed a network with a pooling-and-merging attention module. This module reduces computation by employing attention only on fixed-sized topics and small-sized features. Through extensive experiments, we have demonstrated the superiority of our method in challenging scenarios. Specifically, our method significantly reduces computational costs while maintaining higher image-matching accuracy compared to state-of-the-art methods. The code will be updated soon at https://github.com/TruongKhang/TopicFM

READ FULL TEXT

page 1

page 4

page 7

page 9

page 10

page 12

research
07/01/2022

TopicFM: Robust and Interpretable Feature Matching with Topic-assisted

Finding correspondences across images is an important task in many visua...
research
03/06/2021

Learning Statistical Texture for Semantic Segmentation

Existing semantic segmentation works mainly focus on learning the contex...
research
08/27/2018

simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions

The encode-decoder framework has shown recent success in image captionin...
research
03/10/2021

AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

Two factors have proven to be very important to the performance of seman...
research
05/23/2019

Multi-level Texture Encoding and Representation (MuLTER) based on Deep Neural Networks

In this paper, we propose a multi-level texture encoding and representat...
research
06/22/2021

P2T: Pyramid Pooling Transformer for Scene Understanding

This paper jointly resolves two problems in vision transformer: i) the c...
research
11/08/2022

Submission-Aware Reviewer Profiling for Reviewer Recommender System

Assigning qualified, unbiased and interested reviewers to paper submissi...

Please sign up or login with your details

Forgot password? Click here to reset