Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

02/25/2015
by   Adam W. Harley, et al.
0

This paper presents a new state-of-the-art for document image classification and retrieval, using features learned by deep convolutional neural networks (CNNs). In object and scene analysis, deep neural nets are capable of learning a hierarchical chain of abstraction from pixel inputs to concise and descriptive representations. The current work explores this capacity in the realm of document analysis, and confirms that this representation strategy is superior to a variety of popular hand-crafted alternatives. Experiments also show that (i) features extracted from CNNs are robust to compression, (ii) CNNs trained on non-document images transfer well to document analysis tasks, and (iii) enforcing region-specific feature-learning is unnecessary given sufficient training data. This work also makes available a new labelled subset of the IIT-CDIP collection, containing 400,000 document images across 16 categories, useful for training new CNNs for document analysis.

READ FULL TEXT

page 7

page 8

research
08/10/2017

Analysis of Convolutional Neural Networks for Document Image Classification

Convolutional Neural Networks (CNNs) are state-of-the-art models for doc...
research
05/10/2021

AFINet: Attentive Feature Integration Networks for Image Classification

Convolutional Neural Networks (CNNs) have achieved tremendous success in...
research
06/16/2020

Improving accuracy and speeding up Document Image Classification through parallel systems

This paper presents a study showing the benefits of the EfficientNet mod...
research
01/13/2016

Document image classification, with a specific view on applications of patent images

The main focus of this paper is document image classification and retrie...
research
03/03/2016

What is the right way to represent document images?

In this article we study the problem of document image representation ba...
research
06/30/2017

A selectional auto-encoder approach for document image binarization

Binarization plays a key role in the automatic information retrieval fro...
research
12/05/2017

Deep Learning for automatic sale receipt understanding

As a general rule, data analytics are now mandatory for companies. Scann...

Please sign up or login with your details

Forgot password? Click here to reset