Modular Multimodal Architecture for Document Classification

12/09/2019
by   Tyler Dauphinee, et al.
0

Page classification is a crucial component to any document analysis system, allowing for complex branching control flows for different components of a given document. Utilizing both the visual and textual content of a page, the proposed method exceeds the current state-of-the-art performance on the RVL-CDIP benchmark at 93.03

READ FULL TEXT

page 3

page 4

research
08/24/2023

Beyond Document Page Classification: Design, Datasets, and Challenges

This paper highlights the need to bring document classification benchmar...
research
01/24/2022

Importance of Textlines in Historical Document Classification

This paper describes a system prepared at Brno University of Technology ...
research
09/03/2021

Navigating the Mise-en-Page: Interpretive Machine Learning Approaches to the Visual Layouts of Multi-Ethnic Periodicals

This paper presents a computational method of analysis that draws from m...
research
06/24/2015

Unshredding of Shredded Documents: Computational Framework and Implementation

A shredded document D is a document whose pages have been cut into strip...
research
07/02/2022

Sequence-aware multimodal page classification of Brazilian legal documents

The Brazilian Supreme Court receives tens of thousands of cases each sem...
research
06/11/2021

Generalized Moving Peaks Benchmark

This document describes the Generalized Moving Peaks Benchmark (GMPB) th...
research
10/09/2017

Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

For digitization of paper files via OCR, preservation of document contex...

Please sign up or login with your details

Forgot password? Click here to reset