Spoken dialect identification in Twitter using a multi-filter architecture

06/05/2020
by   Mohammadreza Banaei, et al.
0

This paper presents our approach for SwissText KONVENS 2020 shared task 2, which is a multi-stage neural model for Swiss German (GSW) identification on Twitter. Our model outputs either GSW or non-GSW and is not meant to be used as a generic language identifier. Our architecture consists of two independent filters where the first one favors recall, and the second one filter favors precision (both towards GSW). Moreover, we do not use binary models (GSW vs. not-GSW) in our filters but rather a multi-class classifier with GSW being one of the possible labels. Our model reaches F1-score of 0.982 on the test set of the shared task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2020

Inno at SemEval-2020 Task 11: Leveraging Pure Transformer for Multi-Class Propaganda Detection

The paper presents the solution of team "Inno" to a SEMEVAL 2020 task 11...
research
07/29/2021

IIITG-ADBU@HASOC-Dravidian-CodeMix-FIRE2020: Offensive Content Detection in Code-Mixed Dravidian Text

This paper presents the results obtained by our SVM and XLM-RoBERTa base...
research
09/09/2018

SHOMA at Parseme Shared Task on Automatic Identification of VMWEs: Neural Multiword Expression Tagging with High Generalisation

This paper presents a language-independent deep learning architecture ad...
research
07/22/2018

German Dialect Identification Using Classifier Ensembles

In this paper we present the GDI_classification entry to the second Germ...
research
01/20/2021

Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi

Recently the NLP community has started showing interest towards the chal...
research
08/18/2019

TwistBytes -- Hierarchical Classification at GermEval 2019: walking the fine line (of recall and precision)

We present here our approach to the GermEval 2019 Task 1 - Shared Task o...

Please sign up or login with your details

Forgot password? Click here to reset