German Dialect Identification Using Classifier Ensembles

07/22/2018
by   Alina Maria Ciobanu, et al.
0

In this paper we present the GDI_classification entry to the second German Dialect Identification (GDI) shared task organized within the scope of the VarDial Evaluation Campaign 2018. We present a system based on SVM classifier ensembles trained on characters and words. The system was trained on a collection of speech transcripts of five Swiss-German dialects provided by the organizers. The transcripts included in the dataset contained speakers from Basel, Bern, Lucerne, and Zurich. Our entry in the challenge reached 62.03 F1-score and was ranked third out of eight teams.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 4

07/09/2018

Discriminating between Indo-Aryan Languages Using SVM Ensembles

In this paper we present a system based on SVM ensembles trained on char...
04/27/2019

Experiments in Cuneiform Language Identification

This paper presents methods to discriminate between languages and dialec...
11/12/2018

Classifying Patent Applications with Ensemble Methods

We present methods for the automatic classification of patent applicatio...
06/05/2020

Spoken dialect identification in Twitter using a multi-filter architecture

This paper presents our approach for SwissText KONVENS 2020 shared t...
10/02/2020

Cross-Lingual Transfer Learning for Complex Word Identification

Complex Word Identification (CWI) is a task centered on detecting hard-t...
09/20/2021

Language Identification with a Reciprocal Rank Classifier

Language identification is a critical component of language processing p...
10/24/2017

Clickbait Identification using Neural Networks

This paper presents the results of our participation in the Clickbait De...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.