Document Visual Question Answering Challenge 2020

08/20/2020
by   Minesh Mathew, et al.
2

This paper presents results of Document Visual Question Answering Challenge organized as part of "Text and Documents in the Deep Learning Era" workshop, in CVPR 2020. The challenge introduces a new problem - Visual Question Answering on document images. The challenge comprised two tasks. The first task concerns with asking questions on a single document image. On the other hand, the second task is set as a retrieval task where the question is posed over a collection of images. For the task 1 a new dataset is introduced comprising 50,000 questions-answer(s) pairs defined over 12,767 document images. For task 2 another dataset has been created comprising 20 questions over 14,362 document images which share the same document template.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2021

Document Collection Visual Question Answering

Current tasks and methods in Document Understanding aims to process docu...
research
11/10/2021

ICDAR 2021 Competition on Document VisualQuestion Answering

In this report we present results of the ICDAR 2021 edition of the Docum...
research
03/27/2023

TabIQA: Table Questions Answering on Business Document Images

Table answering questions from business documents has many challenges th...
research
05/31/2015

Visual Madlibs: Fill in the blank Image Generation and Question Answering

In this paper, we introduce a new dataset consisting of 360,001 focused ...
research
05/23/2023

DUBLIN – Document Understanding By Language-Image Network

Visual document understanding is a complex task that involves analyzing ...
research
12/05/2022

QBERT: Generalist Model for Processing Questions

Using a single model across various tasks is beneficial for training and...

Please sign up or login with your details

Forgot password? Click here to reset