A machine transliteration tool between Uzbek alphabets

05/19/2022
by   Ulugbek Salaev, et al.
0

Machine transliteration, as defined in this paper, is a process of automatically transforming written script of words from a source alphabet into words of another target alphabet within the same language, while preserving their meaning, as well as pronunciation. The main goal of this paper is to present a machine transliteration tool between three common scripts used in low-resource Uzbek language: the old Cyrillic, currently official Latin, and newly announced New Latin alphabets. The tool has been created using a combination of rule-based and fine-tuning approaches. The created tool is available as an open-source Python package, as well as a web-based application including a public API. To our knowledge, this is the first machine transliteration tool that supports the newly announced Latin alphabet of the Uzbek language.

READ FULL TEXT
research
01/30/2023

UzbekTagger: The rule-based POS tagger for Uzbek language

This research paper presents a part-of-speech (POS) annotated dataset an...
research
04/17/2023

Prak: An automatic phonetic alignment tool for Czech

Labeling speech down to the identity and time boundaries of phones is a ...
research
10/19/2020

PySBD: Pragmatic Sentence Boundary Disambiguation

In this paper, we present a rule-based sentence boundary disambiguation ...
research
10/28/2022

Development of a rule-based lemmatization algorithm through Finite State Machine for Uzbek language

Lemmatization is one of the core concepts in natural language processing...
research
01/06/2017

Online characterization of planetary surfaces: PlanetServer, an open-source analysis and visualization tool

The lack of open-source tools for hyperspectral data visualization and a...
research
01/01/2018

PronouncUR: An Urdu Pronunciation Lexicon Generator

State-of-the-art speech recognition systems rely heavily on three basic ...
research
10/28/2020

PeopleXploit – A hybrid tool to collect public data

This paper introduces the concept of Open Source Intelligence (OSINT) as...

Please sign up or login with your details

Forgot password? Click here to reset