Document Understanding Dataset and Evaluation (DUDE)

05/15/2023
by   Jordy Landeghem, et al.
0

We call on the Document AI (DocAI) community to reevaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset with novelties related to types of questions, answers, and document layouts based on multi-industry, multi-domain, and multi-page VRDs of various origins, and dates. Moreover, we are pushing the boundaries of current methods by creating multi-task and multi-domain evaluation setups that more accurately simulate real-world situations where powerful generalization and adaptation under low-resource settings are desired. DUDE aims to set a new standard as a more practical, long-standing benchmark for the community, and we hope that it will lead to future extensions and contributions that address real-world challenges. Finally, our work illustrates the importance of finding more efficient ways to model language, images, and layout in DocAI.

READ FULL TEXT

page 16

page 22

research
04/13/2023

PDFVQA: A New Dataset for Real-World VQA on PDF Documents

Document-based Visual Question Answering examines the document understan...
research
07/25/2022

Towards Complex Document Understanding By Discrete Reasoning

Document Visual Question Answering (VQA) aims to understand visually-ric...
research
08/24/2023

Beyond Document Page Classification: Design, Datasets, and Challenges

This paper highlights the need to bring document classification benchmar...
research
07/31/2023

Workshop on Document Intelligence Understanding

Document understanding and information extraction include different task...
research
11/20/2019

Table-Of-Contents generation on contemporary documents

The generation of precise and detailed Table-Of-Contents (TOC) from a do...
research
08/21/2023

bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali Documents

Despite the existence of numerous Optical Character Recognition (OCR) to...
research
10/09/2017

Grand Challenges of Traceability: The Next Ten Years

In 2007, the software and systems traceability community met at the firs...

Please sign up or login with your details

Forgot password? Click here to reset