Survey on Publicly Available Sinhala Natural Language Processing Tools and Research

06/05/2019
by   Nisansa de Silva, et al.
0

Sinhala is the native language of the Sinhalese people who make up the largest ethnic group of Sri Lanka. The language belongs to the globe-spanning language tree, Indo-European. However, due to poverty in both linguistic and economic capital, Sinhala, in the perspective of Natural Language processing tools and research, remains a resource-poor language which has neither the economic drive its cousin English has nor the sheer push of the law of numbers a language such as Chinese has. A number of research groups from Sri Lanka have noticed this lack and the dire need for proper tools and research for Sinhala natural language processing. However, due to various reasons, these attempts seem to lack coordination and awareness of each other. The objective of this paper is to fill that gap of a comprehensive literature survey of the publicly available Sinhala natural language tools and research so that the researchers working in this field can better utilize contributions of their peers. As such, we shall be uploading this paper to arXiv and perpetually update it periodically to reflect the advances made on the topic.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2022

Resources for Turkish Natural Language Processing: A critical survey

This paper presents a comprehensive survey of corpora and lexical resour...
research
07/27/2018

A Survey of the Usages of Deep Learning in Natural Language Processing

Over the last several years, the field of natural language processing ha...
research
07/07/2020

Targeting the Benchmark: On Methodology in Current Natural Language Processing Research

It has become a common pattern in our field: One group introduces a lang...
research
04/12/2022

Robust Quantification of Gender Disparity in Pre-Modern English Literature using Natural Language Processing

Research has continued to shed light on the extent and significance of g...
research
10/10/2022

Self-move and Other-move: Quantum Categorical Foundations of Japanese

The purpose of this work is to contribute toward the larger goal of crea...
research
07/24/2023

Making Metadata More FAIR Using Large Language Models

With the global increase in experimental data artifacts, harnessing them...
research
09/20/2021

A mixed-methods ethnographic approach to participatory budgeting in Scotland

Participatory budgeting (PB) is already well established in Scotland in ...

Please sign up or login with your details

Forgot password? Click here to reset