Context based Roman-Urdu to Urdu Script Transliteration System

09/29/2021
by   H Muhammad Shakeel, et al.
0

Now a day computer is necessary for human being and it is very useful in many fields like search engine, text processing, short messaging services, voice chatting and text recognition. Since last many years there are many tools and techniques that have been developed to support the writing of language script. Most of the Asian languages like Arabic, Urdu, Persian, Chains and Korean are written in Roman alphabets. Roman alphabets are the most commonly used for transliteration of languages, which have non-Latin scripts. For writing Urdu characters as an input, there are many layouts which are already exist. Mostly Urdu speaker prefer to use Roman-Urdu for different applications, because mostly user is not familiar with Urdu language keyboard. The objective of this work is to improve the context base transliteration of Roman-Urdu to Urdu script. In this paper, we propose an algorithm which effectively solve the transliteration issues. The algorithm work like, convert the encoding roman words into the words in the standard Urdu script and match it with the lexicon. If match found, then display the word in the text editor. The highest frequency words are displayed if more than one match found in the lexicon. Display the first encoded and converted instance and set it to the default if there is not a single instance of the match is found and then adjust the given ambiguous word to their desire location according to their context. The outcome of this algorithm proved the efficiency and significance as compare to other models and algorithms which work for transliteration of Raman-Urdu to Urdu on context.

READ FULL TEXT
research
01/05/2021

edATLAS: An Efficient Disambiguation Algorithm for Texting in Languages with Abugida Scripts

Abugida refers to a phonogram writing system where each syllable is repr...
research
12/16/2021

Khmer Word Search: Challenges, Solutions, and Semantic-Aware Search

Search is one of the key functionalities in digital platforms and applic...
research
05/05/2020

Digraphie des langues ouest africaines : Latin2Ajami : un algorithme de translitteration automatique

The national languages of Senegal, like those of West Africa country in ...
research
10/21/2022

Doctors Handwritten Prescription Recognition System In Multi Language Using Deep Learning

Doctors typically write in incomprehensible handwriting, making it diffi...
research
05/05/2020

Digraph of Senegal s local languages: issues, challenges and prospects of their transliteration

The local languages in Senegal, like those of West African countries in ...
research
12/19/2022

Words as Gatekeepers: Measuring Discipline-specific Terms and Meanings in Scholarly Publications

Scholarly text is often laden with jargon, or specialized language that ...
research
01/23/2021

ARTH: Algorithm For Reading Text Handily – An AI Aid for People having Word Processing Issues

The objective of this project is to solve one of the major problems face...

Please sign up or login with your details

Forgot password? Click here to reset