Improving accuracy and speeding up Document Image Classification through parallel systems

06/16/2020
by   Javier Ferrando, et al.
5

This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions. We show in the RVL-CDIP dataset that we can improve previous results with a much lighter model and present its transfer learning capabilities on a smaller in-domain dataset such as Tobacco3482. Moreover, we present an ensemble pipeline which is able to boost solely image input by combining image model predictions with the ones generated by BERT model on extracted text by OCR. We also show that the batch size can be effectively increased without hindering its accuracy so that the training process can be sped up by parallelizing throughout multiple GPUs, decreasing the computational time needed. Lastly, we expose the training performance differences between PyTorch and Tensorflow Deep Learning frameworks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2018

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

In this work, a region-based Deep Convolutional Neural Network framework...
research
02/25/2015

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

This paper presents a new state-of-the-art for document image classifica...
research
08/31/2018

Seeing Colors: Learning Semantic Text Encoding for Classification

The question we answer with this work is: can we convert a text document...
research
05/20/2022

Deep transfer learning for image classification: a survey

Deep neural networks such as convolutional neural networks (CNNs) and tr...
research
12/04/2018

Bag of Tricks for Image Classification with Convolutional Neural Networks

Much of the recent progress made in image classification research can be...
research
12/07/2017

Distributed learning of CNNs on heterogeneous CPU/GPU architectures

Convolutional Neural Networks (CNNs) have shown to be powerful classific...
research
03/03/2016

What is the right way to represent document images?

In this article we study the problem of document image representation ba...

Please sign up or login with your details

Forgot password? Click here to reset