VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and Challenges

12/26/2022
by   Rufai Yusuf Zakari, et al.
0

Artificial Intelligence (AI) and its applications have sparked extraordinary interest in recent years. This achievement can be ascribed in part to advances in AI subfields including Machine Learning (ML), Computer Vision (CV), and Natural Language Processing (NLP). Deep learning, a sub-field of machine learning that employs artificial neural network concepts, has enabled the most rapid growth in these domains. The integration of vision and language has sparked a lot of attention as a result of this. The tasks have been created in such a way that they properly exemplify the concepts of deep learning. In this review paper, we provide a thorough and an extensive review of the state of the arts approaches, key models design principles and discuss existing datasets, methods, their problem formulation and evaluation measures for VQA and Visual reasoning tasks to understand vision and language representation learning. We also present some potential future paths in this field of research, with the hope that our study may generate new ideas and novel approaches to handle existing difficulties and develop new applications.

READ FULL TEXT

page 7

page 8

page 25

page 28

research
12/20/2022

A Survey of Deep Learning for Mathematical Reasoning

Mathematical reasoning is a fundamental aspect of human intelligence and...
research
11/29/2021

Collective Intelligence for Deep Learning: A Survey of Recent Developments

In the past decade, we have witnessed the rise of deep learning to domin...
research
07/22/2019

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Integration of vision and language tasks has seen a significant growth i...
research
01/13/2022

Fantastic Data and How to Query Them

It is commonly acknowledged that the availability of the huge amount of ...
research
10/29/2021

Systematic Review for AI-based Language Learning Tools

The Second Language Acquisition field has been significantly impacted by...
research
06/23/2015

A Survey of Current Datasets for Vision and Language Research

Integrating vision and language has long been a dream in work on artific...
research
11/16/2020

Deep Learning – A first Meta-Survey of selected Reviews across Scientific Disciplines and their Research Impact

Deep learning belongs to the field of artificial intelligence, where mac...

Please sign up or login with your details

Forgot password? Click here to reset