TS-Net: OCR Trained to Switch Between Text Transcription Styles

03/09/2021
by   Jan Kohút, et al.
0

Users of OCR systems, from different institutions and scientific disciplines, prefer and produce different transcription styles. This presents a problem for training of consistent text recognition neural networks on real-world data. We propose to extend existing text recognition networks with a Transcription Style Block (TSB) which can learn from data to switch between multiple transcription styles without any explicit knowledge of transcription rules. TSB is an adaptive instance normalization conditioned by identifiers representing consistently transcribed documents (e.g. single document, documents by a single transcriber, or an institution). We show that TSB is able to learn completely different transcription styles in controlled experiments on artificial data, it improves text recognition accuracy on large-scale real-world data, and it learns semantically meaningful transcription style embedding. We also show how TSB can efficiently adapt to transcription styles of new documents from transcriptions of only a few text lines.

READ FULL TEXT
research
05/21/2018

Batch-Instance Normalization for Adaptively Style-Invariant Neural Networks

Real-world image recognition is often challenged by the variability of v...
research
08/26/2020

Generating Handwriting via Decouple Style Descriptors

Representing a space of handwriting stroke styles includes the challenge...
research
06/21/2021

Photozilla: A Large-Scale Photography Dataset and Visual Embedding for 20 Photography Styles

The advent of social media platforms has been a catalyst for the develop...
research
04/05/2021

MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition

Handwritten Text Recognition (HTR) remains a challenging problem to date...
research
02/13/2023

Towards Writing Style Adaptation in Handwriting Recognition

One of the challenges of handwriting recognition is to transcribe a larg...
research
10/10/2016

Highly Robust Clustering of GPS Driver Data for Energy Efficient Driving Style Modelling

This paper presents a novel approach to distinguish driving styles with ...
research
05/30/2023

AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation

This paper presents a method that can quickly adapt dynamic 3D avatars t...

Please sign up or login with your details

Forgot password? Click here to reset