Language bias in Visual Question Answering: A Survey and Taxonomy

11/16/2021
by   Desen Yuan, et al.
0

Visual question answering (VQA) is a challenging task, which has attracted more and more attention in the field of computer vision and natural language processing. However, the current visual question answering has the problem of language bias, which reduces the robustness of the model and has an adverse impact on the practical application of visual question answering. In this paper, we conduct a comprehensive review and analysis of this field for the first time, and classify the existing methods according to three categories, including enhancing visual information, weakening language priors, data enhancement and training strategies. At the same time, the relevant representative methods are introduced, summarized and analyzed in turn. The causes of language bias are revealed and classified. Secondly, this paper introduces the datasets mainly used for testing, and reports the experimental results of various existing methods. Finally, we discuss the possible future research directions in this field.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2016

Visual Question Answering: Datasets, Algorithms, and Future Challenges

Visual Question Answering (VQA) is a recent problem in computer vision a...
research
05/10/2017

Survey of Visual Question Answering: Datasets and Techniques

Visual question answering (or VQA) is a new and exciting problem that co...
research
07/21/2023

Robust Visual Question Answering: Datasets, Methods, and Future Challenges

Visual question answering requires a system to provide an accurate natur...
research
05/18/2023

Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature

Visual Question Answering (VQA) is an emerging area of interest for rese...
research
01/15/2021

Recent Advances in Video Question Answering: A Review of Datasets and Methods

Video Question Answering (VQA) is a recent emerging challenging task in ...
research
03/22/2020

Visual Question Answering for Cultural Heritage

Technology and the fruition of cultural heritage are becoming increasing...
research
04/03/2022

Adjusting for Bias with Procedural Data

3D softwares are now capable of producing highly realistic images that l...

Please sign up or login with your details

Forgot password? Click here to reset