Resources for Turkish Natural Language Processing: A critical survey

04/11/2022
by   Çağrı Çöltekin, et al.
0

This paper presents a comprehensive survey of corpora and lexical resources available for Turkish. We review a broad range of resources, focusing on the ones that are publicly available. In addition to providing information about the available linguistic resources, we present a set of recommendations, and identify gaps in the data available for conducting research and building applications in Turkish Linguistics and Natural Language Processing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2023

A Survey of Resources and Methods for Natural Language Processing of Serbian Language

The Serbian language is a Slavic language spoken by over 12 million spea...
research
06/05/2019

Survey on Publicly Available Sinhala Natural Language Processing Tools and Research

Sinhala is the native language of the Sinhalese people who make up the l...
research
03/07/2017

Building a Syllable Database to Solve the Problem of Khmer Word Segmentation

Word segmentation is a basic problem in natural language processing. Wit...
research
02/25/2020

Detecting Asks in SE attacks: Impact of Linguistic and Structural Knowledge

Social engineers attempt to manipulate users into undertaking actions su...
research
02/25/2017

Critical Survey of the Freely Available Arabic Corpora

The availability of corpora is a major factor in building natural langua...
research
04/19/2023

Bridging Natural Language Processing and Psycholinguistics: computationally grounded semantic similarity datasets for Basque and Spanish

We present a computationally-grounded word similarity dataset based on t...
research
06/05/2023

Easy-to-Read in Germany: A Survey on its Current State and Available Resources

Easy-to-Read Language (E2R) is a controlled language variant that makes ...

Please sign up or login with your details

Forgot password? Click here to reset