BERTgrid: Contextualized Embedding for 2D Document Representation and Understanding

09/11/2019
by   Timo I. Denk, et al.
0

For understanding generic documents, information like font sizes, column layout, and generally the positioning of words may carry semantic information that is crucial for solving a downstream document intelligence task. Our novel BERTgrid, which is based on Chargrid by Katti et al. (2018), represents a document as a grid of contextualized word piece embedding vectors, thereby making its spatial structure and semantics accessible to the processing neural network. The contextualized embedding vectors are retrieved from a BERT language model. We use BERTgrid in combination with a fully convolutional network on a semantic instance segmentation task for extracting fields from invoices. We demonstrate its performance on tabulated line item and document header field extraction.

READ FULL TEXT
research
09/24/2018

Chargrid: Towards Understanding 2D Documents

We introduce a novel type of text representation that preserves the 2D l...
research
06/07/2017

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Network

We present an end-to-end, multimodal, fully convolutional network for ex...
research
05/13/2021

VSR: A Unified Framework for Document Layout Analysis combining Vision, Semantics and Relations

Document layout analysis is crucial for understanding document structure...
research
07/16/2021

The Law of Large Documents: Understanding the Structure of Legal Contracts Using Visual Cues

Large, pre-trained transformer models like BERT have achieved state-of-t...
research
05/25/2021

ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents

Recent grid-based document representations like BERTgrid allow the simul...
research
11/11/2017

Learning Document Embeddings With CNNs

We propose a new model for unsupervised document embedding. Existing app...
research
11/28/2017

Semantic Technology-Assisted Review (STAR) Document analysis and monitoring using random vectors

The review and analysis of large collections of documents and the period...

Please sign up or login with your details

Forgot password? Click here to reset