A General Framework for Learning Prosodic-Enhanced Representation of Rap Lyrics

03/23/2021
by   Hongru Liang, et al.
4

Learning and analyzing rap lyrics is a significant basis for many web applications, such as music recommendation, automatic music categorization, and music information retrieval, due to the abundant source of digital music in the World Wide Web. Although numerous studies have explored the topic, knowledge in this field is far from satisfactory, because critical issues, such as prosodic information and its effective representation, as well as appropriate integration of various features, are usually ignored. In this paper, we propose a hierarchical attention variational autoencoder framework (HAVAE), which simultaneously consider semantic and prosodic features for rap lyrics representation learning. Specifically, the representation of the prosodic features is encoded by phonetic transcriptions with a novel and effective strategy (i.e., rhyme2vec). Moreover, a feature aggregation strategy is proposed to appropriately integrate various features and generate prosodic-enhanced representation. A comprehensive empirical evaluation demonstrates that the proposed framework outperforms the state-of-the-art approaches under various metrics in different rap lyrics learning tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2018

Modeling Melodic Feature Dependency with Modularized Variational Auto-Encoder

Automatic melody generation has been a long-time aspiration for both AI ...
research
08/28/2018

Representation Learning for Image-based Music Recommendation

Image perception is one of the most direct ways to provide contextual in...
research
04/15/2020

Musical Features for Automatic Music Transcription Evaluation

This technical report gives a detailed, formal description of the featur...
research
02/12/2018

One Deep Music Representation to Rule Them All? : A comparative analysis of different representation learning strategies

Inspired by the success of deploying deep learning in the fields of Comp...
research
02/11/2022

The HaMSE Ontology: Using Semantic Technologies to support Music Representation Interoperability and Musicological Analysis

The use of Semantic Technologies - in particular the Semantic Web - has ...
research
11/02/2021

Multi-input Architecture and Disentangled Representation Learning for Multi-dimensional Modeling of Music Similarity

In the context of music information retrieval, similarity-based approach...
research
03/06/2021

ReadNet: A Hierarchical Transformer Framework for Web Article Readability Analysis

Analyzing the readability of articles has been an important sociolinguis...

Please sign up or login with your details

Forgot password? Click here to reset