Analysis of Convolutional Neural Networks for Document Image Classification

08/10/2017
by   Chris Tensmeyer, et al.
0

Convolutional Neural Networks (CNNs) are state-of-the-art models for document image classification tasks. However, many of these approaches rely on parameters and architectures designed for classifying natural images, which differ from document images. We question whether this is appropriate and conduct a large empirical study to find what aspects of CNNs most affect performance on document images. Among other results, we exceed the state-of-the-art on the RVL-CDIP dataset by using shear transform data augmentation and an architecture designed for a larger input image. Additionally, we analyze the learned features and find evidence that CNNs trained on RVL-CDIP learn region-specific layout features.

READ FULL TEXT

page 5

page 6

research
02/25/2015

Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval

This paper presents a new state-of-the-art for document image classifica...
research
09/28/2022

CompNet: A Designated Model to Handle Combinations of Images and Designed features

Convolutional neural networks (CNNs) are one of the most popular models ...
research
11/11/2020

Dealing with Robustness of Convolutional Neural Networks for Image Classification

SW-based systems depend more and more on AI also for critical tasks. For...
research
11/11/2019

Learning From Brains How to Regularize Machines

Despite impressive performance on numerous visual tasks, Convolutional N...
research
04/11/2023

Zoom is what you need: An empirical study of the power of zoom and spatial biases in image classification

Image classifiers are information-discarding machines, by design. Yet, h...
research
01/19/2020

SlideImages: A Dataset for Educational Image Classification

In the past few years, convolutional neural networks (CNNs) have achieve...
research
07/29/2019

Salient Slices: Improved Neural Network Training and Performance with Image Entropy

As a training and analysis strategy for convolutional neural networks (C...

Please sign up or login with your details

Forgot password? Click here to reset