It's Only Words And Words Are All I Have

01/16/2019
by   Manash Pratim Barman, et al.
0

The central idea of this paper is to demonstrate the strength of lyrics for music mining and natural language processing (NLP) tasks using the distributed representation paradigm. For music mining, we address two prediction tasks for songs: genre and popularity. Existing works for both these problems have two major bottlenecks. First, they represent lyrics using handcrafted features that require intricate knowledge of language and music. Second, they consider lyrics as a weak indicator of genre and popularity. We overcome both the bottlenecks by representing lyrics using distributed representation. In our work, genre identification is a multi-class classification task whereas popularity prediction is a binary classification task. We achieve an F1 score of around 0.6 for both the tasks using only lyrics. Distributed representation of words is now heavily used for various NLP algorithms. We show that lyrics can be used to improve the quality of this representation.

READ FULL TEXT
research
03/06/2020

Brazilian Lyrics-Based Music Genre Classification Using a BLSTM Network

Organize songs, albums, and artists in groups with shared similarity cou...
research
07/23/2020

Musical Word Embedding: Bridging the Gap between Listening Contexts and Music

Word embedding pioneered by Mikolov et al. is a staple technique for wor...
research
08/02/2022

BEIKE NLP at SemEval-2022 Task 4: Prompt-Based Paragraph Classification for Patronizing and Condescending Language Detection

PCL detection task is aimed at identifying and categorizing language tha...
research
09/21/2022

MulBot: Unsupervised Bot Detection Based on Multivariate Time Series

Online social networks are actively involved in the removal of malicious...
research
04/23/2020

Towards an evolutionary-based approach for natural language processing

Tasks related to Natural Language Processing (NLP) have recently been th...
research
08/01/2019

Learning Joint Acoustic-Phonetic Word Embeddings

Most speech recognition tasks pertain to mapping words across two modali...
research
05/18/2020

Classification of Spam Emails through Hierarchical Clustering and Supervised Learning

Spammers take advantage of email popularity to send indiscriminately uns...

Please sign up or login with your details

Forgot password? Click here to reset