PP-OCRv3: More Attempts for the Improvement of Ultra Lightweight OCR System

06/07/2022
by   Chenxia Li, et al.
0

Optical character recognition (OCR) technology has been widely used in various scenes, as shown in Figure 1. Designing a practical OCR system is still a meaningful but challenging task. In previous work, considering the efficiency and accuracy, we proposed a practical ultra lightweight OCR system (PP-OCR), and an optimized version PP-OCRv2. In order to further improve the performance of PP-OCRv2, a more robust OCR system PP-OCRv3 is proposed in this paper. PP-OCRv3 upgrades the text detection model and text recognition model in 9 aspects based on PP-OCRv2. For text detector, we introduce a PAN module with large receptive field named LK-PAN, a FPN module with residual attention mechanism named RSE-FPN, and DML distillation strategy. For text recognizer, the base model is replaced from CRNN to SVTR, and we introduce lightweight text recognition network SVTR LCNet, guided training of CTC by attention, data augmentation strategy TextConAug, better pre-trained model by self-supervised TextRotNet, UDML, and UIM to accelerate the model and improve the effect. Experiments on real data show that the hmean of PP-OCRv3 is 5 PP-OCRv2 under comparable inference speed. All the above mentioned models are open-sourced and the code is available in the GitHub repository PaddleOCR which is powered by PaddlePaddle.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 8

research
09/07/2021

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System

Optical Character Recognition (OCR) systems have been widely used in var...
research
09/21/2020

PP-OCR: A Practical Ultra Lightweight OCR System

The Optical Character Recognition (OCR) systems have been widely used in...
research
10/11/2022

PP-StructureV2: A Stronger Document Analysis System

A large amount of document data exists in unstructured form such as raw ...
research
11/01/2021

PP-ShiTu: A Practical Lightweight Image Recognition System

In recent years, image recognition applications have developed rapidly. ...
research
11/01/2022

Self-supervised Character-to-Character Distillation

Handling complicated text images (e.g., irregular structures, low resolu...
research
05/17/2021

STRIDE : Scene Text Recognition In-Device

Optical Character Recognition (OCR) systems have been widely used in var...
research
08/12/2019

Self-supervised Data Bootstrapping for Deep Optical Character Recognition of Identity Documents

The essential task of verifying person identities at airports and nation...

Please sign up or login with your details

Forgot password? Click here to reset