CodeSwitch-Reddit: Exploration of Written Multilingual Discourse in Online Discussion Forums

08/30/2019
by   Ella Rabinovich, et al.
0

In contrast to many decades of research on oral code-switching, the study of written multilingual productions has only recently enjoyed a surge of interest. Many open questions remain regarding the sociolinguistic underpinnings of written code-switching, and progress has been limited by a lack of suitable resources. We introduce a novel, large, and diverse dataset of written code-switched productions, curated from topical threads of multiple bilingual communities on the Reddit discussion platform, and explore questions that were mainly addressed in the context of spoken language thus far. We investigate whether findings in oral code-switching concerning content and style, as well as speaker proficiency, are carried over into written code-switching in discussion forums. The released dataset can further facilitate a range of research and practical activities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2022

The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges

Code-Switching, a common phenomenon in written text and conversation, ha...
research
09/24/2018

Hindi-English Code-Switching Speech Corpus

Code-switching refers to the usage of two languages within a sentence or...
research
03/25/2019

A Survey of Code-switched Speech and Language Processing

Code-switching, the alternation of languages within a conversation or ut...
research
10/13/2020

X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models

Language models (LMs) have proven surprisingly successful at capturing f...
research
03/24/2021

Are Multilingual Models Effective in Code-Switching?

Multilingual language models have shown decent performance in multilingu...
research
08/29/2023

Shared Lexical Items as Triggers of Code Switching

Why do bilingual speakers code-switch (mix their two languages)? Among t...
research
04/04/2023

GPT-4 to GPT-3.5: 'Hold My Scalpel' – A Look at the Competency of OpenAI's GPT on the Plastic Surgery In-Service Training Exam

The Plastic Surgery In-Service Training Exam (PSITE) is an important ind...

Please sign up or login with your details

Forgot password? Click here to reset