Image Transformer has recently achieved significant progress for natural...
Document AI, or Document Intelligence, is a relatively new research topi...
Multimodal pre-training with text, layout, and image has made significan...
Reading order detection is the cornerstone to understanding visually-ric...
Multimodal pre-training with text, layout, and image has achieved SOTA
p...
We propose a new framework for computing the embeddings of large-scale g...
Pre-training of text and layout has proved effective in a variety of
vis...
Document layout analysis usually relies on computer vision models to
und...
Pre-training techniques have been verified successfully in a variety of ...