InfographicVQA

04/26/2021
by   Minesh Mathew, et al.
12

Infographics are documents designed to effectively communicate information using a combination of textual, graphical and visual elements. In this work, we explore the automatic understanding of infographic images by using Visual Question Answering technique.To this end, we present InfographicVQA, a new dataset that comprises a diverse collection of infographics along with natural language questions and answers annotations. The collected questions require methods to jointly reason over the document layout, textual content, graphical elements, and data visualizations. We curate the dataset with emphasis on questions that require elementary reasoning and basic arithmetic skills. Finally, we evaluate two strong baselines based on state of the art multi-modal VQA models, and establish baseline performance for the new task. The dataset, code and leaderboard will be made available at http://docvqa.org

READ FULL TEXT

page 1

page 4

page 15

page 17

page 18

page 19

page 21

page 22

research
04/23/2020

Visual Question Answering Using Semantic Information from Image Descriptions

Visual question answering (VQA) is a task that requires AI systems to di...
research
04/13/2023

PDFVQA: A New Dataset for Real-World VQA on PDF Documents

Document-based Visual Question Answering examines the document understan...
research
04/27/2021

Document Collection Visual Question Answering

Current tasks and methods in Document Understanding aims to process docu...
research
07/25/2022

Towards Complex Document Understanding By Discrete Reasoning

Document Visual Question Answering (VQA) aims to understand visually-ric...
research
08/03/2021

CanvasVAE: Learning to Generate Vector Graphic Documents

Vector graphic documents present visual elements in a resolution free, c...
research
10/19/2017

FigureQA: An Annotated Figure Dataset for Visual Reasoning

We introduce FigureQA, a visual reasoning corpus of over one million que...
research
07/05/2023

Line Graphics Digitization: A Step Towards Full Automation

The digitization of documents allows for wider accessibility and reprodu...

Please sign up or login with your details

Forgot password? Click here to reset