DWT-CompCNN: Deep Image Classification Network for High Throughput JPEG 2000 Compressed Documents

06/02/2023
by   Tejasvee Bisen, et al.
0

For any digital application with document images such as retrieval, the classification of document images becomes an essential stage. Conventionally for the purpose, the full versions of the documents, that is the uncompressed document images make the input dataset, which poses a threat due to the big volume required to accommodate the full versions of the documents. Therefore, it would be novel, if the same classification task could be accomplished directly (with some partial decompression) with the compressed representation of documents in order to make the whole process computationally more efficient. In this research work, a novel deep learning model, DWT CompCNN is proposed for classification of documents that are compressed using High Throughput JPEG 2000 (HTJ2K) algorithm. The proposed DWT-CompCNN comprises of five convolutional layers with filter sizes of 16, 32, 64, 128, and 256 consecutively for each increasing layer to improve learning from the wavelet coefficients extracted from the compressed images. Experiments are performed on two benchmark datasets- Tobacco-3482 and RVL-CDIP, which demonstrate that the proposed model is time and space efficient, and also achieves a better classification accuracy in compressed domain.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 13

page 14

research
09/13/2022

Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

Image binarization techniques are being popularly used in enhancement of...
research
10/11/2014

Direct Processing of Document Images in Compressed Domain

With the rapid increase in the volume of Big data of this digital era, f...
research
07/26/2019

DCT-CompCNN: A Novel Image Classification Network Using JPEG Compressed DCT Coefficients

The popularity of Convolutional Neural Network (CNN) in the field of Ima...
research
06/20/2020

Remote Sensing Image Scene Classification with Deep Neural Networks in JPEG 2000 Compressed Domain

To reduce the storage requirements, remote sensing (RS) images are usual...
research
01/25/2021

Spanner Evaluation over SLP-Compressed Documents

We consider the problem of evaluating regular spanners over compressed d...
research
06/20/2017

Passive Classification of Source Printer using Text-line-level Geometric Distortion Signatures from Scanned Images of Printed Documents

In this digital era, one thing that still holds the convention is a prin...

Please sign up or login with your details

Forgot password? Click here to reset