Progressive Scale-aware Network for Remote sensing Image Change Captioning

03/01/2023
by   Chenyang Liu, et al.
0

Remote sensing (RS) images contain numerous objects of different scales, which poses significant challenges for the RS image change captioning (RSICC) task to identify visual changes of interest in complex scenes and describe them via language. However, current methods still have some weaknesses in sufficiently extracting and utilizing multi-scale information. In this paper, we propose a progressive scale-aware network (PSNet) to address the problem. PSNet is a pure Transformer-based model. To sufficiently extract multi-scale visual features, multiple progressive difference perception (PDP) layers are stacked to progressively exploit the differencing features of bitemporal features. To sufficiently utilize the extracted multi-scale features for captioning, we propose a scale-aware reinforcement (SR) module and combine it with the Transformer decoding layer to progressively utilize the features from different PDP layers. Experiments show that the PDP layer and SR module are effective and our PSNet outperforms previous methods.

READ FULL TEXT
research
10/14/2022

MCTNet: A Multi-Scale CNN-Transformer Network for Change Detection in Optical Remote Sensing Images

For the task of change detection (CD) in remote sensing images, deep con...
research
09/04/2023

Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images

Vision Foundation Models (VFMs) such as the Segment Anything Model (SAM)...
research
04/21/2022

Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval

Remote sensing (RS) cross-modal text-image retrieval has attracted exten...
research
02/21/2023

HCGMNET: A Hierarchical Change Guiding Map Network For Change Detection

Very-high-resolution (VHR) remote sensing (RS) image change detection (C...
research
06/03/2023

Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection

Popular Transformer networks have been successfully applied to remote se...
research
08/18/2021

WRICNet:A Weighted Rich-scale Inception Coder Network for Multi-Resolution Remote Sensing Image Change Detection

Majority models of remote sensing image changing detection can only get ...
research
04/12/2023

APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP

In recent years, the success of large-scale vision-language models (VLMs...

Please sign up or login with your details

Forgot password? Click here to reset