Exploiting Multi-Scale Fusion, Spatial Attention and Patch Interaction Techniques for Text-Independent Writer Identification

11/20/2021
by   Abhishek Srivastava, et al.
5

Text independent writer identification is a challenging problem that differentiates between different handwriting styles to decide the author of the handwritten text. Earlier writer identification relied on handcrafted features to reveal pieces of differences between writers. Recent work with the advent of convolutional neural network, deep learning-based methods have evolved. In this paper, three different deep learning techniques - spatial attention mechanism, multi-scale feature fusion and patch-based CNN were proposed to effectively capture the difference between each writer's handwriting. Our methods are based on the hypothesis that handwritten text images have specific spatial regions which are more unique to a writer's style, multi-scale features propagate characteristic features with respect to individual writers and patch-based features give more general and robust representations that helps to discriminate handwriting from different writers. The proposed methods outperforms various state-of-the-art methodologies on word-level and page-level writer identification methods on three publicly available datasets - CVL, Firemaker, CERUG-EN datasets and give comparable performance on the IAM dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2020

Text-independent writer identification using convolutional neural network

The text-independent approach to writer identification does not require ...
research
10/01/2019

A Computationally Efficient Pipeline Approach to Full Page Offline Handwritten Text Recognition

Offline handwriting recognition with deep neural networks is usually lim...
research
06/21/2016

DeepWriter: A Multi-Stream Deep CNN for Text-independent Writer Identification

Text-independent writer identification is challenging due to the huge va...
research
08/15/2022

An Efficient Multi-Scale Fusion Network for 3D Organ at Risk (OAR) Segmentation

Accurate segmentation of organs-at-risks (OARs) is a precursor for optim...
research
01/01/2018

Script Identification in Natural Scene Image and Video Frame using Attention based Convolutional-LSTM Network

Script identification facilitates many important applications in documen...
research
08/09/2023

Learning multi-domain feature relation for visible and Long-wave Infrared image patch matching

Recently, learning-based algorithms have achieved promising performance ...
research
05/17/2023

Two-Stream Regression Network for Dental Implant Position Prediction

In implant prosthesis treatment, the design of surgical guide requires l...

Please sign up or login with your details

Forgot password? Click here to reset