Joint Energy-based Detection and Classificationon of Multilingual Text Lines

07/23/2014
by   Igor Milevskiy, et al.
0

This paper proposes a new hierarchical MDL-based model for a joint detection and classification of multilingual text lines in im- ages taken by hand-held cameras. The majority of related text detec- tion methods assume alphabet-based writing in a single language, e.g. in Latin. They use simple clustering heuristics specific to such texts: prox- imity between letters within one line, larger distance between separate lines, etc. We are interested in a significantly more ambiguous problem where images combine alphabet and logographic characters from multiple languages and typographic rules vary a lot (e.g. English, Korean, and Chinese). Complexity of detecting and classifying text lines in multiple languages calls for a more principled approach based on information- theoretic principles. Our new MDL model includes data costs combining geometric errors with classification likelihoods and a hierarchical sparsity term based on label costs. This energy model can be efficiently minimized by fusion moves. We demonstrate robustness of the proposed algorithm on a large new database of multilingual text images collected in the pub- lic transit system of Seoul.

READ FULL TEXT

page 2

page 4

page 6

page 13

research
01/18/2021

Text line extraction using fully convolutional network and energy minimization

Text lines are important parts of handwritten document images and easier...
research
02/06/2020

Irony Detection in a Multilingual Context

This paper proposes the first multilingual (French, English and Arabic) ...
research
08/19/2023

AltDiffusion: A Multilingual Text-to-Image Diffusion Model

Large Text-to-Image(T2I) diffusion models have shown a remarkable capabi...
research
11/11/2022

MINION: a Large-Scale and Diverse Dataset for Multilingual Event Detection

Event Detection (ED) is the task of identifying and classifying trigger ...
research
01/13/2023

Multilingual Detection of Check-Worthy Claims using World Languages and Adapter Fusion

Check-worthiness detection is the task of identifying claims, worthy to ...
research
02/28/2023

Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition

While vision transformers have been highly successful in improving the p...
research
10/13/2022

Task Grouping for Multilingual Text Recognition

Most existing OCR methods focus on alphanumeric characters due to the po...

Please sign up or login with your details

Forgot password? Click here to reset