Document AI: Benchmarks, Models and Applications

11/16/2021
by   Lei Cui, et al.
0

Document AI, or Document Intelligence, is a relatively new research topic that refers to the techniques for automatically reading, understanding, and analyzing business documents. It is an important research direction for natural language processing and computer vision. In recent years, the popularity of deep learning technology has greatly advanced the development of Document AI, such as document layout analysis, visual information extraction, document visual question answering, document image classification, etc. This paper briefly reviews some of the representative models, tasks, and benchmark datasets. Furthermore, we also introduce early-stage heuristic rule-based document analysis, statistical machine learning algorithms, and deep learning approaches especially pre-training methods. Finally, we look into future directions for Document AI research.

READ FULL TEXT
research
04/21/2023

Information Extraction from Documents: Question Answering vs Token Classification in real-world setups

Research in Document Intelligence and especially in Document Key Informa...
research
11/27/2020

A Survey of Deep Learning Approaches for OCR and Document Understanding

Documents are a core part of many businesses in many fields such as law,...
research
02/15/2020

Historical Document Processing: Historical Document Processing: A Survey of Techniques, Tools, and Trends

Historical Document Processing is the process of digitizing written mate...
research
03/16/2022

A Survey of Historical Document Image Datasets

This paper presents a systematic literature review of image datasets for...
research
05/24/2023

Cream: Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

Advances in Large Language Models (LLMs) have inspired a surge of resear...
research
11/30/2021

Donut: Document Understanding Transformer without OCR

Understanding document images (e.g., invoices) has been an important res...

Please sign up or login with your details

Forgot password? Click here to reset