Sudowoodo: a Chinese Lyric Imitation System with Source Lyrics

08/09/2023
by   Yongzhu Chang, et al.
0

Lyrics generation is a well-known application in natural language generation research, with several previous studies focusing on generating accurate lyrics using precise control such as keywords, rhymes, etc. However, lyrics imitation, which involves writing new lyrics by imitating the style and content of the source lyrics, remains a challenging task due to the lack of a parallel corpus. In this paper, we introduce \textbf{\textit{Sudowoodo}}, a Chinese lyrics imitation system that can generate new lyrics based on the text of source lyrics. To address the issue of lacking a parallel training corpus for lyrics imitation, we propose a novel framework to construct a parallel corpus based on a keyword-based lyrics model from source lyrics. Then the pairs \textit{(new lyrics, source lyrics)} are used to train the lyrics imitation model. During the inference process, we utilize a post-processing module to filter and rank the generated lyrics, selecting the highest-quality ones. We incorporated audio information and aligned the lyrics with the audio to form the songs as a bonus. The human evaluation results show that our framework can perform better lyric imitation. Meanwhile, the \textit{Sudowoodo} system and demo video of the system is available at \href{https://Sudowoodo.apps-hp.danlu.netease.com/}{Sudowoodo} and \href{https://youtu.be/u5BBT_j1L5M}{https://youtu.be/u5BBT\_j1L5M}.

READ FULL TEXT
research
11/26/2018

LSICC: A Large Scale Informal Chinese Corpus

Deep learning based natural language processing model is proven powerful...
research
11/22/2022

imitation: Clean Imitation Learning Implementations

imitation provides open-source implementations of imitation and reward l...
research
03/24/2020

Generating Chinese Poetry from Images via Concrete and Abstract Information

In recent years, the automatic generation of classical Chinese poetry ha...
research
05/31/2019

Effective writing style imitation via combinatorial paraphrasing

Stylometry can be used to profile authors based on their written text. T...
research
08/28/2023

TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models

Recently, there has been a growing interest in the field of controllable...
research
05/25/2023

The False Promise of Imitating Proprietary LLMs

An emerging method to cheaply improve a weaker language model is to fine...
research
11/07/2022

CELLS: A Parallel Corpus for Biomedical Lay Language Generation

Recent lay language generation systems have used Transformer models trai...

Please sign up or login with your details

Forgot password? Click here to reset