Yiheng Xu

research

∙ 03/04/2022

DiT: Self-supervised Pre-training for Document Image Transformer

Image Transformer has recently achieved significant progress for natural...

6 Junlong Li, et al. ∙

research

∙ 11/16/2021

Document AI: Benchmarks, Models and Applications

Document AI, or Document Intelligence, is a relatively new research topi...

0 Lei Cui, et al. ∙

research

∙ 10/16/2021

MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding

Multimodal pre-training with text, layout, and image has made significan...

0 Junlong Li, et al. ∙

research

∙ 08/26/2021

LayoutReader: Pre-training of Text and Layout for Reading Order Detection

Reading order detection is the cornerstone to understanding visually-ric...

0 Zilong Wang, et al. ∙

research

∙ 04/18/2021

LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding

Multimodal pre-training with text, layout, and image has achieved SOTA p...

0 Yiheng Xu, et al. ∙

research

∙ 01/20/2021

Marius: Learning Massive Graph Embeddings on a Single Machine

We propose a new framework for computing the embeddings of large-scale g...

0 Jason Mohoney, et al. ∙

research

∙ 12/29/2020

LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding

Pre-training of text and layout has proved effective in a variety of vis...

0 Yang Xu, et al. ∙

research

∙ 06/01/2020

DocBank: A Benchmark Dataset for Document Layout Analysis

Document layout analysis usually relies on computer vision models to und...

0 Minghao Li, et al. ∙

research

∙ 12/31/2019

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Pre-training techniques have been verified successfully in a variety of ...

0 Yiheng Xu, et al. ∙

Yiheng Xu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro