Word Recognition with Deep Conditional Random Fields

12/04/2016
by   Gang Chen, et al.
0

Recognition of handwritten words continues to be an important problem in document analysis and recognition. Existing approaches extract hand-engineered features from word images--which can perform poorly with new data sets. Recently, deep learning has attracted great attention because of the ability to learn features from raw data. Moreover they have yielded state-of-the-art results in classification tasks including character recognition and scene recognition. On the other hand, word recognition is a sequential problem where we need to model the correlation between characters. In this paper, we propose using deep Conditional Random Fields (deep CRFs) for word recognition. Basically, we combine CRFs with deep learning, in which deep features are learned and sequences are labeled in a unified framework. We pre-train the deep structure with stacked restricted Boltzmann machines (RBMs) for feature learning and optimize the entire network with an online learning algorithm. The proposed model was evaluated on two datasets, and seen to perform significantly better than competitive baseline models. The source code is available at https://github.com/ganggit/deepCRFs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2020

Persian Handwritten Digit, Character, and Words Recognition by Using Deep Learning Methods

Digit, character, and word recognition of a particular script play a key...
research
02/25/2019

Bengali Handwritten Character Classification using Transfer Learning on Deep Convolutional Neural Network

In this paper, we propose a solution which uses state-of-the-art techniq...
research
10/17/2019

Convolutional Character Networks

Recent progress has been made on developing a unified framework for join...
research
04/05/2019

NELEC at SemEval-2019 Task 3: Think Twice Before Going Deep

Existing Machine Learning techniques yield close to human performance on...
research
07/12/2019

Boosting Scene Character Recognition by Learning Canonical Forms of Glyphs

As one of the fundamental problems in document analysis, scene character...
research
12/04/2016

Joint Visual Denoising and Classification using Deep Learning

Visual restoration and recognition are traditionally addressed in pipeli...
research
07/23/2017

Language modeling with Neural trans-dimensional random fields

Trans-dimensional random field language models (TRF LMs) have recently b...

Please sign up or login with your details

Forgot password? Click here to reset