Acquisition of Translation Lexicons for Historically Unwritten Languages via Bridging Loanwords

06/06/2017
by   Michael Bloodgood, et al.
0

With the advent of informal electronic communications such as social media, colloquial languages that were historically unwritten are being written for the first time in heavily code-switched environments. We present a method for inducing portions of translation lexicons through the use of expert knowledge in these settings where there are approximately zero resources available other than a language informant, potentially not even large amounts of monolingual data. We investigate inducing a Moroccan Darija-English translation lexicon via French loanwords bridging into English and find that a useful lexicon is induced for human-assisted translation and statistical machine translation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2022

Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data

Code-mixed machine translation has become an important task in multiling...
research
03/19/2020

Utilizing Language Relatedness to improve Machine Translation: A Case Study on Languages of the Indian Subcontinent

In this work, we present an extensive study of statistical machine trans...
research
09/10/2023

The Effect of Alignment Objectives on Code-Switching Translation

One of the things that need to change when it comes to machine translati...
research
04/20/2020

PHINC: A Parallel Hinglish Social Media Code-Mixed Corpus for Machine Translation

Code-mixing is the phenomenon of using more than one language in a sente...
research
06/11/2023

Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction

Neural models have drastically advanced state of the art for machine tra...
research
10/19/2018

Mainumby: un Ayudante para la Traducción Castellano-Guaraní

A wide range of applications play an important role in the daily work of...
research
11/27/2019

Sideways Transliteration: How to Transliterate Multicultural Person Names?

In a global setting, texts contain transliterated names from many cultur...

Please sign up or login with your details

Forgot password? Click here to reset