Chat Image Generator Video Music Voice Chat Photo Editor

Modular Multimodal Architecture for Document Classification

12/09/2019

∙

Page classification is a crucial component to any document analysis system, allowing for complex branching control flows for different components of a given document. Utilizing both the visual and textual content of a page, the proposed method exceeds the current state-of-the-art performance on the RVL-CDIP benchmark at 93.03

READ FULL TEXT

Modular Multimodal Architecture for Document Classification

Sign in with Google

Consider DeepAI Pro