An Analysis of Classification Approaches for Hit Song Prediction using Engineered Metadata Features with Lyrics and Audio Features

01/31/2023
by   Mengyisong Zhao, et al.
0

Hit song prediction, one of the emerging fields in music information retrieval (MIR), remains a considerable challenge. Being able to understand what makes a given song a hit is clearly beneficial to the whole music industry. Previous approaches to hit song prediction have focused on using audio features of a record. This study aims to improve the prediction result of the top 10 hits among Billboard Hot 100 songs using more alternative metadata, including song audio features provided by Spotify, song lyrics, and novel metadata-based features (title topic, popularity continuity and genre class). Five machine learning approaches are applied, including: k-nearest neighbours, Naive Bayes, Random Forest, Logistic Regression and Multilayer Perceptron. Our results show that Random Forest (RF) and Logistic Regression (LR) with all features (including novel features, song audio features and lyrics features) outperforms other models, achieving 89.1 AUC, respectively. Our findings also demonstrate the utility of our novel music metadata features, which contributed most to the models' discriminative performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/16/2021

Context-Based Music Recommendation Algorithm Evaluation

Artificial Intelligence (AI ) has been very successful in creating and p...
research
12/03/2018

Music Popularity: Metrics, Characteristics, and Audio-Based Prediction

Understanding music popularity is important not only for the artists who...
research
12/17/2017

Using Deep learning methods for generation of a personalized list of shuffled songs

The shuffle mode, where songs are played in a randomized order that is d...
research
08/16/2020

Prediction of Homicides in Urban Centers: A Machine Learning Approach

Relevant research has been standing out in the computing community aimin...
research
10/16/2020

Hit Song Prediction Based on Early Adopter Data and Audio Features

Billions of USD are invested in new artists and songs by the music indus...
research
05/03/2022

Detecting Phishing sites Without Visiting them

Now-a-days, cyberattacks are increasing at an unprecedented rate. Phishi...
research
02/28/2020

UKARA 1.0 Challenge Track 1: Automatic Short-Answer Scoring in Bahasa Indonesia

We describe our third-place solution to the UKARA 1.0 challenge on autom...

Please sign up or login with your details

Forgot password? Click here to reset