Multilingual Augmentation for Robust Visual Question Answering in Remote Sensing Images

04/07/2023
by   Zhenghang Yuan, et al.
0

Aiming at answering questions based on the content of remotely sensed images, visual question answering for remote sensing data (RSVQA) has attracted much attention nowadays. However, previous works in RSVQA have focused little on the robustness of RSVQA. As we aim to enhance the reliability of RSVQA models, how to learn robust representations against new words and different question templates with the same meaning is the key challenge. With the proposed augmented dataset, we are able to obtain more questions in addition to the original ones with the same meaning. To make better use of this information, in this study, we propose a contrastive learning strategy for training robust RSVQA models against diverse question templates and words. Experimental results demonstrate that the proposed augmented dataset is effective in improving the robustness of the RSVQA model. In addition, the contrastive learning strategy performs well on the low resolution (LR) dataset.

READ FULL TEXT
research
03/16/2020

RSVQA: Visual Question Answering for Remote Sensing Data

This paper introduces the task of visual question answering for remote s...
research
06/25/2023

Visual Question Answering in Remote Sensing with Cross-Attention and Multimodal Information Bottleneck

In this research, we deal with the problem of visual question answering ...
research
05/06/2022

From Easy to Hard: Learning Language-guided Curriculum for Visual Question Answering on Remote Sensing Data

Visual question answering (VQA) for remote sensing scene has great poten...
research
07/02/2020

IIE-NLP-NUT at SemEval-2020 Task 4: Guiding PLM with Prompt Template Reconstruction Strategy for ComVE

This paper introduces our systems for the first two subtasks of SemEval ...
research
12/31/2018

The meaning of "most" for visual question answering models

The correct interpretation of quantifier statements in the context of a ...
research
12/05/2018

Are you tough enough? Framework for Robustness Validation of Machine Comprehension Systems

Deep Learning NLP domain lacks procedures for the analysis of model robu...
research
03/26/2022

EYNet: Extended YOLO for Airport Detection in Remote Sensing Images

Nowadays, airport detection in remote sensing images has attracted consi...

Please sign up or login with your details

Forgot password? Click here to reset