K-pop Lyric Translation: Dataset, Analysis, and Neural-Modelling

09/20/2023
by   Haven Kim, et al.
0

Lyric translation, a field studied for over a century, is now attracting computational linguistics researchers. We identified two limitations in previous studies. Firstly, lyric translation studies have predominantly focused on Western genres and languages, with no previous study centering on K-pop despite its popularity. Second, the field of lyric translation suffers from a lack of publicly available datasets; to the best of our knowledge, no such dataset exists. To broaden the scope of genres and languages in lyric translation studies, we introduce a novel singable lyric translation dataset, approximately 89% of which consists of K-pop song lyrics. This dataset aligns Korean and English lyrics line-by-line and section-by-section. We leveraged this dataset to unveil unique characteristics of K-pop lyric translation, distinguishing it from other extensively studied genres, and to construct a neural lyric translation model, thereby underscoring the importance of a dedicated dataset for singable lyric translations.

READ FULL TEXT

page 3

page 5

research
08/26/2023

A Computational Evaluation Framework for Singable Lyric Translation

Lyric translation plays a pivotal role in amplifying the global resonanc...
research
07/01/2018

Lost in Translation: Analysis of Information Loss During Machine Translation Between Polysynthetic and Fusional Languages

Machine translation from polysynthetic to fusional languages is a challe...
research
06/14/2020

FFR v1.1: Fon-French Neural Machine Translation

All over the world and especially in Africa, researchers are putting eff...
research
03/26/2020

FFR V1.0: Fon-French Neural Machine Translation

Africa has the highest linguistic diversity in the world. On account of ...
research
06/06/2021

Itihasa: A large-scale corpus for Sanskrit to English translation

This work introduces Itihasa, a large-scale translation dataset containi...
research
10/19/2018

Mainumby: un Ayudante para la Traducción Castellano-Guaraní

A wide range of applications play an important role in the daily work of...
research
02/15/2023

NL2CMD: An Updated Workflow for Natural Language to Bash Commands Translation

Translating natural language into Bash Commands is an emerging research ...

Please sign up or login with your details

Forgot password? Click here to reset