Text detection and recognition based on a lensless imaging system

10/09/2022
by   Yinger Zhang, et al.
0

Lensless cameras are characterized by several advantages (e.g., miniaturization, ease of manufacture, and low cost) as compared with conventional cameras. However, they have not been extensively employed due to their poor image clarity and low image resolution, especially for tasks that have high requirements on image quality and details such as text detection and text recognition. To address the problem, a framework of deep-learning-based pipeline structure was built to recognize text with three steps from raw data captured by employing lensless cameras. This pipeline structure consisted of the lensless imaging model U-Net, the text detection model connectionist text proposal network (CTPN), and the text recognition model convolutional recurrent neural network (CRNN). Compared with the method focusing only on image reconstruction, UNet in the pipeline was able to supplement the imaging details by enhancing factors related to character categories in the reconstruction process, so the textual information can be more effectively detected and recognized by CTPN and CRNN with fewer artifacts and high-clarity reconstructed lensless images. By performing experiments on datasets of different complexities, the applicability to text detection and recognition on lensless cameras was verified. This study reasonably demonstrates text detection and recognition tasks in the lensless camera system,and develops a basic method for novel applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

research
08/14/2020

Performance characterization of a novel deep learning-based MR image reconstruction pipeline

A novel deep learning-based magnetic resonance imaging reconstruction pi...
research
04/01/2022

Extremely Low-light Image Enhancement with Scene Text Restoration

Deep learning-based methods have made impressive progress in enhancing e...
research
07/30/2020

Quantitative Distortion Analysis of Flattening Applied to the Scroll from En-Gedi

Non-invasive volumetric imaging can now capture the internal structure a...
research
08/02/2019

Y-Net: A Hybrid Deep Learning Reconstruction Framework for Photoacoustic Imaging in vivo

Photoacoustic imaging (PAI) is an emerging non-invasive imaging modality...
research
01/22/2021

AS-Net: Fast Photoacoustic Reconstruction with Multi-feature Fusion from Sparse Data

Photoacoustic (PA) imaging is a biomedical imaging modality capable of a...
research
03/08/2022

Unrolled Primal-Dual Networks for Lensless Cameras

Conventional image reconstruction models for lensless cameras often assu...
research
11/27/2018

A Compositional Textual Model for Recognition of Imperfect Word Images

Printed text recognition is an important problem for industrial OCR syst...

Please sign up or login with your details

Forgot password? Click here to reset