First Steps Toward CNN based Source Classification of Document Images Shared Over Messaging App

08/17/2018
by   Sharad Joshi, et al.
0

Knowledge of source smartphone corresponding to a document image can be helpful in a variety of applications including copyright infringement, ownership attribution, leak identification and usage restriction. In this letter, we investigate a convolutional neural network-based approach to solve source smartphone identification problem for printed text documents which have been captured by smartphone cameras and shared over messaging platform. In absence of any publicly available dataset addressing this problem, we introduce a new image dataset consisting of 315 images of documents printed in three different fonts, captured using 21 smartphones and shared over WhatsApp. Experiments conducted on this dataset demonstrate that, in all scenarios, the proposed system performs as well as or better than the state-of-the-art system based on handcrafted features and classification of letters extracted from document images. The new dataset and code of the proposed system will be made publicly available along with this letter's publication, presently they are submitted for review.

READ FULL TEXT
research
03/27/2020

Source Printer Identification from Document Images Acquired using Smartphone

Vast volumes of printed documents continue to be used for various import...
research
06/22/2017

Single Classifier-based Passive System for Source Printer Classification using Local Texture Features

An important aspect of examining printed documents for potential forgeri...
research
06/16/2023

Acoustic Identification of Ae. aegypti Mosquitoes using Smartphone Apps and Residual Convolutional Neural Networks

In this paper, we advocate in favor of smartphone apps as low-cost, easy...
research
08/06/2023

Unmasking the Invisible: Finding Location-Specific Aggregated Air Quality Index with Smartphone-Captured Images

The prevalence and mobility of smartphones make these a widely used tool...
research
06/18/2018

Source Printer Classification using Printer Specific Local Texture Descriptor

The knowledge of source printer can help in printed text document authen...
research
05/19/2021

Light-weight Document Image Cleanup using Perceptual Loss

Smartphones have enabled effortless capturing and sharing of documents i...
research
10/12/2022

Lbl2Vec: An Embedding-Based Approach for Unsupervised Document Retrieval on Predefined Topics

In this paper, we consider the task of retrieving documents with predefi...

Please sign up or login with your details

Forgot password? Click here to reset