A Rule-based Kurdish Text Transliteration System

11/26/2018
by   Sina Ahmadi, et al.
0

In this article, we present a rule-based approach for transliterating two mostly used orthographies in Sorani Kurdish. Our work consists of detecting a character in a word by removing the possible ambiguities and mapping it into the target orthography. We describe different challenges in Kurdish text mining and propose novel ideas concerning the transliteration task for Sorani Kurdish. Our transliteration system, named Wergor, achieves 82.79 more than 99 manually transliterated corpus for Kurdish.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2021

PρLog: a system for rule-based programming

In this paper, we give a brief overview of the rule-based programming sy...
research
08/19/2017

A rule based algorithm for detecting negative words in Persian

In this paper, we present a novel method for detecting negative words in...
research
10/20/2020

RAN Cognitive Controller

Cognitive Autonomous Networks (CAN) deploys learning based Cognitive Fun...
research
09/06/2017

"Having 2 hours to write a paper is fun!": Detecting Sarcasm in Numerical Portions of Text

Sarcasm occurring due to the presence of numerical portions in text has ...
research
08/23/2016

Using Semantic Similarity for Input Topic Identification in Crawling-based Web Application Testing

To automatically test web applications, crawling-based techniques are us...
research
03/29/2022

Seq-2-Seq based Refinement of ASR Output for Spoken Name Capture

Person name capture from human speech is a difficult task in human-machi...
research
11/03/2020

A Benchmark of Rule-Based and Neural Coreference Resolution in Dutch Novels and News

We evaluate a rule-based (Lee et al., 2013) and neural (Lee et al., 2018...

Please sign up or login with your details

Forgot password? Click here to reset