Discriminating Between Similar Nordic Languages

12/11/2020
by   René Haas, et al.
0

Automatic language identification is a challenging problem. Discriminating between closely related languages is especially difficult. This paper presents a machine learning approach for automatic language identification for the Nordic languages, which often suffer miscategorisation by existing state-of-the-art tools. Concretely we will focus on discrimination between six Nordic languages: Danish, Swedish, Norwegian (Nynorsk), Norwegian (Bokmål), Faroese and Icelandic.

READ FULL TEXT

page 2

page 4

page 5

page 6

research
03/26/2018

Automatic Identification of Closely-related Indian Languages: Resources and Experiments

In this paper, we discuss an attempt to develop an automatic language id...
research
09/30/2016

Discriminating Similar Languages: Evaluations and Explorations

We present an analysis of the performance of machine learning classifier...
research
02/25/2023

The 𝖠𝖢^0-Complexity Of Visibly Pushdown Languages

We concern ourselves with the question which visibly pushdown languages ...
research
06/09/2022

Language Identification for Austronesian Languages

This paper provides language identification models for low- and under-re...
research
10/21/2022

AfroLID: A Neural Language Identification Tool for African Languages

Language identification (LID) is a crucial precursor for NLP, especially...
research
02/10/2022

Decomposition Problem in Process of Selective Identification and Localization of Voltage Fluctuations Sources in Power Grids

Voltage fluctuations are common disturbances in power grids, therefore t...
research
07/15/2015

Language discrimination and clustering via a neural network approach

We classify twenty-one Indo-European languages starting from written tex...

Please sign up or login with your details

Forgot password? Click here to reset