Next word prediction based on the N-gram model for Kurdish Sorani and Kurmanji

07/27/2020
by   Hozan K. Hamarashid, et al.
0

Next word prediction is an input technology that simplifies the process of typing by suggesting the next word to a user to select, as typing in a conversation consumes time. A few previous studies have focused on the Kurdish language, including the use of next word prediction. However, the lack of a Kurdish text corpus presents a challenge. Moreover, the lack of a sufficient number of N-grams for the Kurdish language, for instance, five grams, is the reason for the rare use of next Kurdish word prediction. Furthermore, the improper display of several Kurdish letters in the Rstudio software is another problem. This paper provides a Kurdish corpus, creates five, and presents a unique research work on next word prediction for Kurdish Sorani and Kurmanji. The N-gram model has been used for next word prediction to reduce the amount of time while typing in the Kurdish language. In addition, little work has been conducted on next Kurdish word prediction; thus, the N-gram model is utilized to suggest text accurately. To do so, R programming and RStudio are used to build the application. The model is 96.3

READ FULL TEXT

page 19

page 22

page 23

page 24

page 28

page 29

page 30

page 31

research
01/27/2017

Bangla Word Clustering Based on Tri-gram, 4-gram and 5-gram Language Model

In this paper, we describe a research method that generates Bangla word ...
research
01/08/2022

A comprehensive review and evaluation on text predictive and entertainment systems

One of the most important ways to experience communication and interact ...
research
12/25/2019

N-gram Statistical Stemmer for Bangla Corpus

Stemming is a process that can be utilized to trim inflected words to st...
research
07/05/2021

On Bi-gram Graph Attributes

We propose a new approach to text semantic analysis and general corpus a...
research
07/06/2017

An Embedded Deep Learning based Word Prediction

Recent developments in deep learning with application to language modeli...
research
04/01/2020

An Improved Classification Model for Igbo Text Using N-Gram And K-Nearest Neighbour Approaches

This paper presents an improved classification model for Igbo text using...
research
01/13/2021

On consistency scores in text data with an implementation in R

In this paper, we introduce a reproducible cleaning process for the text...

Please sign up or login with your details

Forgot password? Click here to reset