Exploring Multi-Tasking Learning in Document Attribute Classification

08/30/2021
by   Tanmoy Mondal, et al.
0

In this work, we adhere to explore a Multi-Tasking learning (MTL) based network to perform document attribute classification such as the font type, font size, font emphasis and scanning resolution classification of a document image. To accomplish these tasks, we operate on either segmented word level or on uniformed size patches randomly cropped out of the document. Furthermore, a hybrid convolution neural network (CNN) architecture "MTL+MI", which is based on the combination of MTL and Multi-Instance (MI) of patch and word is used to accomplish joint learning for the classification of the same document attributes. The contribution of this paper are three fold: firstly, based on segmented word images and patches, we present a MTL based network for the classification of a full document image. Secondly, we propose a MTL and MI (using segmented words and patches) based combined CNN architecture ("MTL+MI") for the classification of same document attributes. Thirdly, based on the multi-tasking classifications of the words and/or patches, we propose an intelligent voting system which is based on the posterior probabilities of each words and/or patches to perform the classification of document's attributes of complete document image.

READ FULL TEXT

page 6

page 11

research
09/20/2019

Document Rectification and Illumination Correction using a Patch-based CNN

We propose a novel learning method to rectify document images with vario...
research
02/01/2016

Efficient Character-level Document Classification by Combining Convolution and Recurrent Layers

Document classification tasks were primarily tackled at word level. Rece...
research
09/17/2020

Word Segmentation from Unconstrained Handwritten Bangla Document Images using Distance Transform

Segmentation of handwritten document images into text lines and words is...
research
07/06/2020

Reflection-based Word Attribute Transfer

Word embeddings, which often represent such analogic relations as king -...
research
06/05/2021

An End-to-End Breast Tumour Classification Model Using Context-Based Patch Modelling- A BiLSTM Approach for Image Classification

Researchers working on computational analysis of Whole Slide Images (WSI...
research
12/15/2014

Highly Efficient Forward and Backward Propagation of Convolutional Neural Networks for Pixelwise Classification

We present highly efficient algorithms for performing forward and backwa...
research
10/04/2015

A Novel Approach to Document Classification using WordNet

Content based Document Classification is one of the biggest challenges i...

Please sign up or login with your details

Forgot password? Click here to reset