Information Extraction from Documents: Question Answering vs Token Classification in real-world setups

04/21/2023
by   Laurent Lam, et al.
0

Research in Document Intelligence and especially in Document Key Information Extraction (DocKIE) has been mainly solved as Token Classification problem. Recent breakthroughs in both natural language processing (NLP) and computer vision helped building document-focused pre-training methods, leveraging a multimodal understanding of the document text, layout and image modalities. However, these breakthroughs also led to the emergence of a new DocKIE subtask of extractive document Question Answering (DocQA), as part of the Machine Reading Comprehension (MRC) research field. In this work, we compare the Question Answering approach with the classical token classification approach for document key information extraction. We designed experiments to benchmark five different experimental setups : raw performances, robustness to noisy environment, capacity to extract long entities, fine-tuning speed on Few-Shot Learning and finally Zero-Shot Learning. Our research showed that when dealing with clean and relatively short entities, it is still best to use token classification-based approach, while the QA approach could be a good alternative for noisy environment or long entities use-cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2023

Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering

The pre-training-fine-tuning paradigm based on layout-aware multimodal p...
research
11/16/2021

Document AI: Benchmarks, Models and Applications

Document AI, or Document Intelligence, is a relatively new research topi...
research
04/05/2020

Natural language processing for word sense disambiguation and information extraction

This research work deals with Natural Language Processing (NLP) and extr...
research
08/21/2023

DocPrompt: Large-scale continue pretrain for zero-shot and few-shot document question answering

In this paper, we propose Docprompt for document question answering task...
research
08/13/2021

Zero-shot Task Transfer for Invoice Extraction via Class-aware QA Ensemble

We present VESPA, an intentionally simple yet novel zero-shot system for...
research
05/23/2023

DUBLIN – Document Understanding By Language-Image Network

Visual document understanding is a complex task that involves analyzing ...
research
11/07/2021

Information Extraction from Visually Rich Documents with Font Style Embeddings

Information extraction (IE) from documents is an intensive area of resea...

Please sign up or login with your details

Forgot password? Click here to reset