End-to-End Optical Character Recognition for Bengali Handwritten Words

05/09/2021
by   Farisa Benta Safir, et al.
0

Optical character recognition (OCR) is a process of converting analogue documents into digital using document images. Currently, many commercial and non-commercial OCR systems exist for both handwritten and printed copies for different languages. Despite this, very few works are available in case of recognising Bengali words. Among them, most of the works focused on OCR of printed Bengali characters. This paper introduces an end-to-end OCR system for Bengali language. The proposed architecture implements an end to end strategy that recognises handwritten Bengali words from handwritten word images. We experiment with popular convolutional neural network (CNN) architectures, including DenseNet, Xception, NASNet, and MobileNet to build the OCR architecture. Further, we experiment with two different recurrent neural networks (RNN) methods, LSTM and GRU. We evaluate the proposed architecture using BanglaWritting dataset, which is a peer-reviewed Bengali handwritten image dataset. The proposed method achieves 0.091 character error rate and 0.273 word error rate performed using DenseNet121 model with GRU recurrent layer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2020

Persian Handwritten Digit, Character, and Words Recognition by Using Deep Learning Methods

Digit, character, and word recognition of a particular script play a key...
research
02/25/2022

Improving Amharic Handwritten Word Recognition Using Auxiliary Task

Amharic is one of the official languages of the Federal Democratic Repub...
research
03/13/2023

Handwritten Word Recognition using Deep Learning Approach: A Novel Way of Generating Handwritten Words

A handwritten word recognition system comes with issues such as lack of ...
research
10/22/2013

Word Spotting in Cursive Handwritten Documents using Modified Character Shape Codes

There is a large collection of Handwritten English paper documents of Hi...
research
08/18/2023

A tailored Handwritten-Text-Recognition System for Medieval Latin

The Bavarian Academy of Sciences and Humanities aims to digitize its Med...
research
10/11/2007

Comparison and Combination of State-of-the-art Techniques for Handwritten Character Recognition: Topping the MNIST Benchmark

Although the recognition of isolated handwritten digits has been a resea...
research
07/30/2020

The Making of 5G: Building an End-to-End 5G-Enabled System

This article documents one of the world's first standards-compliant pre-...

Please sign up or login with your details

Forgot password? Click here to reset