Turkish Native Language Identification

07/27/2023
by   Ahmet Yavuz Uluslu, et al.
0

In this paper, we present the first application of Native Language Identification (NLI) for the Turkish language. NLI involves predicting the writer's first language by analysing their writing in different languages. While most NLI research has focused on English, our study extends its scope to Turkish. We used the recently constructed Turkish Learner Corpus and employed a combination of three syntactic features (CFG production rules, part-of-speech n-grams, and function words) with L2 texts to demonstrate their effectiveness in this task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2018

Native Language Identification using i-vector

The task of determining a speaker's native language based only on his sp...
research
05/30/2017

A Low Dimensionality Representation for Language Variety Identification

Language variety identification aims at labelling texts in a native lang...
research
12/21/2022

Universal versus system-specific features of punctuation usage patterns in major Western languages

The celebrated proverb that "speech is silver, silence is golden" has a ...
research
03/24/2016

Contrastive Analysis with Predictive Power: Typology Driven Estimation of Grammatical Error Distributions in ESL

This work examines the impact of cross-linguistic transfer on grammatica...
research
11/18/2022

Scaling Native Language Identification with Transformer Adapters

Native language identification (NLI) is the task of automatically identi...
research
04/30/2018

A Portuguese Native Language Identification Dataset

In this paper we present NLI-PT, the first Portuguese dataset compiled f...

Please sign up or login with your details

Forgot password? Click here to reset