Source Printer Classification using Printer Specific Local Texture Descriptor

06/18/2018
by   Sharad Joshi, et al.
0

The knowledge of source printer can help in printed text document authentication, copyright ownership, and provide important clues about the author of a fraudulent document along with his/her potential means and motives. Development of automated systems for classifying printed documents based on their source printer, using image processing techniques, is gaining a lot of attention in multimedia forensics. Currently, state-of-the-art systems require that the font of letters present in test documents of unknown origin must be available in those used for training the classifier. In this work, we attempt to take the first step towards overcoming this limitation. Specifically, we introduce a novel printer specific local texture descriptor. The highlight of our technique is the use of encoding and regrouping strategy based on small linear-shaped structures composed of pixels having similar intensity and gradient. The results of experiments performed on two separate datasets show that: 1) on a publicly available dataset, the proposed method outperforms state-of-the-art algorithms for characters printed in the same font, and 2) on another dataset[Code and dataset will be made publicly available with published version of this paper.] having documents printed in four different fonts, the proposed method correctly classifies all test samples when sufficient training data is available in same font setup. In addition, it outperforms state-of-the-art methods for cross font experiments. Moreover, it reduces the confusion between the printers of same brand and model.

READ FULL TEXT
research
06/22/2017

Single Classifier-based Passive System for Source Printer Classification using Local Texture Features

An important aspect of examining printed documents for potential forgeri...
research
01/31/2019

Funnelling: A New Ensemble Method for Heterogeneous Transfer Learning and its Application to Polylingual Text Classification

Polylingual Text Classification (PLC) consists of automatically classify...
research
06/20/2017

Passive Classification of Source Printer using Text-line-level Geometric Distortion Signatures from Scanned Images of Printed Documents

In this digital era, one thing that still holds the convention is a prin...
research
09/09/2019

HoughNet: neural network architecture for vanishing points detection

In this paper we introduce a novel neural network architecture based on ...
research
03/27/2020

Source Printer Identification from Document Images Acquired using Smartphone

Vast volumes of printed documents continue to be used for various import...
research
08/17/2018

First Steps Toward CNN based Source Classification of Document Images Shared Over Messaging App

Knowledge of source smartphone corresponding to a document image can be ...
research
06/29/2023

Classifying Crime Types using Judgment Documents from Social Media

The task of determining crime types based on criminal behavior facts has...

Please sign up or login with your details

Forgot password? Click here to reset