Categorizing ancient documents

08/28/2013
by   Nizar Zaghden, et al.
0

The analysis of historical documents is still a topical issue given the importance of information that can be extracted and also the importance given by the institutions to preserve their heritage. The main idea in order to characterize the content of the images of ancient documents after attempting to clean the image is segmented blocks texts from the same image and tries to find similar blocks in either the same image or the entire image database. Most approaches of offline handwriting recognition proceed by segmenting words into smaller pieces (usually characters) which are recognized separately. Recognition of a word then requires the recognition of all characters (OCR) that compose it. Our work focuses mainly on the characterization of classes in images of old documents. We use Som toolbox for finding classes in documents. We applied also fractal dimensions and points of interest to categorize and match ancient documents.

READ FULL TEXT

page 3

page 4

page 5

research
01/27/2019

Degraded Historical Documents Images Binarization Using a Combination of Enhanced Techniques

Document image binarization is the initial step and a crucial in many do...
research
08/28/2013

Text recognition in both ancient and cartographic documents

This paper deals with the recognition and matching of text in both carto...
research
09/06/2011

Devnagari document segmentation using histogram approach

Document segmentation is one of the critical phases in machine recogniti...
research
07/02/2021

Optical Braille Recognition using Circular Hough Transform

Braille has empowered visually challenged community to read and write. B...
research
06/12/2021

Predicting the Ordering of Characters in Japanese Historical Documents

Japan is a unique country with a distinct cultural heritage, which is re...
research
02/13/2022

Omnifont Persian OCR System Using Primitives

In this paper, we introduce a model-based omnifont Persian OCR system. T...
research
12/15/2014

CITlab ARGUS for historical data tables

We describe CITlab's recognition system for the ANWRESH-2014 competition...

Please sign up or login with your details

Forgot password? Click here to reset