Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition

06/10/2021
by   Ryota Yoshihashi, et al.
9

In the deployment of scene-text spotting systems on mobile platforms, lightweight models with low computation are preferable. In concept, end-to-end (E2E) text spotting is suitable for such purposes because it performs text detection and recognition in a single model. However, current state-of-the-art E2E methods rely on heavy feature extractors, recurrent sequence modellings, and complex shape aligners to pursue accuracy, which means their computations are still heavy. We explore the opposite direction: How far can we go without bells and whistles in E2E text spotting? To this end, we propose a text-spotting method that consists of simple convolutions and a few post-processes, named Context-Free TextSpotter. Experiments using standard benchmarks show that Context-Free TextSpotter achieves real-time text spotting on a GPU with only three million parameters, which is the smallest and fastest among existing deep text spotters, with an acceptable transcription quality degradation compared to heavier ones. Further, we demonstrate that our text spotter can run on a smartphone with affordable latency, which is valuable for building stand-alone OCR applications.

READ FULL TEXT

page 7

page 12

research
01/05/2018

FOTS: Fast Oriented Text Spotting with a Unified Network

Incidental scene text spotting is considered one of the most difficult a...
research
11/21/2016

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

This paper presents an end-to-end trainable fast scene text detector, na...
research
10/31/2020

Real-Time Text Detection and Recognition

Inrecentyears,ConvolutionalNeuralNet-work(CNN) is quite a popular topic,...
research
11/21/2018

A Novel Integrated Framework for Learning both Text Detection and Recognition

In this paper, we propose a novel integrated framework for learning both...
research
05/17/2021

STRIDE : Scene Text Recognition In-Device

Optical Character Recognition (OCR) systems have been widely used in var...
research
10/06/2021

FADNet++: Real-Time and Accurate Disparity Estimation with Configurable Networks

Deep neural networks (DNNs) have achieved great success in the area of c...
research
07/12/2017

Learning a CNN-based End-to-End Controller for a Formula SAE Racecar

We present a set of CNN-based end-to-end models for controls of a Formul...

Please sign up or login with your details

Forgot password? Click here to reset