VisualWordGrid: Information Extraction From Scanned Documents Using A Multimodal Approach

10/07/2020
by   aymen shabou, et al.
0

We introduce a novel approach for scanned document representation to perform fields extraction task. It allows the simultaneous encoding of the textual, visual and layout information in a 3D matrix used as an input to a segmentation model. We improve the recent Chargrid and Wordgrid models in several directions, first by taking into account the visual modality, then by boosting its robustness toward small datasets, while keeping the inference time low. Our approach is tested on public and private document image datasets, showing higher performances compared to the recent state-of-the-art methods.

READ FULL TEXT

page 3

page 5

research
05/25/2021

ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents

Recent grid-based document representations like BERTgrid allow the simul...
research
08/23/2021

Using Neighborhood Context to Improve Information Extraction from Visual Documents Captured on Mobile Phones

Information Extraction from visual documents enables convenient and inte...
research
07/18/2023

Multimodal Machine Learning for Extraction of Theorems and Proofs in the Scientific Literature

Scholarly articles in mathematical fields feature mathematical statement...
research
05/18/2020

Single-sample writers – "Document Filter" and their impacts on writer identification

The writing can be used as an important biometric modality which allows ...
research
05/17/2022

MATrIX – Modality-Aware Transformer for Information eXtraction

We present MATrIX - a Modality-Aware Transformer for Information eXtract...
research
01/06/2021

On-Device Document Classification using multimodal features

From small screenshots to large videos, documents take up a bulk of spac...
research
08/28/2023

Ensemble of Anchor-Free Models for Robust Bangla Document Layout Segmentation

In this research paper, we introduce a novel approach designed for the p...

Please sign up or login with your details

Forgot password? Click here to reset