Challenges of Computational Processing of Code-Switching

10/07/2016
by   Özlem Çetinoğlu, et al.
0

This paper addresses challenges of Natural Language Processing (NLP) on non-canonical multilingual data in which two or more languages are mixed. It refers to code-switching which has become more popular in our daily life and therefore obtains an increasing amount of attention from the research community. We report our experience that cov- ers not only core NLP tasks such as normalisation, language identification, language modelling, part-of-speech tagging and dependency parsing but also more downstream ones such as machine translation and automatic speech recognition. We highlight and discuss the key problems for each of the tasks with supporting examples from different language pairs and relevant previous work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2019

A Survey of Code-switched Speech and Language Processing

Code-switching, the alternation of languages within a conversation or ut...
research
09/27/2017

Replicability Analysis for Natural Language Processing: Testing Significance with Multiple Datasets

With the ever-growing amounts of textual data from a large variety of la...
research
12/04/2019

A Resource for Computational Experiments on Mapudungun

We present a resource for computational experiments on Mapudungun, a pol...
research
02/28/2015

The NLP Engine: A Universal Turing Machine for NLP

It is commonly accepted that machine translation is a more complex task ...
research
06/20/2022

Bilingual by default: Voice Assistants and the role of code-switching in creating a bilingual user experience

Conversational User Interfaces such as Voice Assistants are hugely popul...
research
12/19/2022

The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges

Code-Switching, a common phenomenon in written text and conversation, ha...

Please sign up or login with your details

Forgot password? Click here to reset