MusicNet: Compact Convolutional Neural Network for Real-time Background Music Detection

10/08/2021
by   Chandan K A Reddy, et al.
0

With the recent growth of remote and hybrid work, online meetings often encounter challenging audio contexts such as background noise, music, and echo. Accurate real-time detection of music events can help to improve the user experience in such scenarios, e.g., by switching to high-fidelity music-specific codec or selecting the optimal noise suppression model. In this paper, we present MusicNet – a compact high-performance model for detecting background music in the real-time communications pipeline. In online video meetings, which is our main use case, music almost always co-occurs with speech and background noises, making the accurate classification quite challenging. The proposed model is a binary classifier that consists of a compact convolutional neural network core preceded by an in-model featurization layer. It takes 9 seconds of raw audio as input and does not require any model-specific featurization on the client. We train our model on a balanced subset of the AudioSet data and use 1000 crowd-sourced real test clips to validate the model. Finally, we compare MusicNet performance to 20 other state-of-the-art models. Our classifier gives a true positive rate of 81.3 rate, which is significantly better than any other model in the study. Our model is also 10x smaller and has 4x faster inference than the comparable baseline.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2019

Automatic Lyrics Transcription in Polyphonic Music: Does Background Music Help?

Background music affects lyrics intelligibility of singing vocals in a m...
research
09/23/2019

Automatic Lyrics Alignment and Transcription in Polyphonic Music: Does Background Music Help?

Background music affects lyrics intelligibility of singing vocals in a m...
research
12/01/2018

SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation

Speech, Music and Noise classification/segmentation is an important prep...
research
08/07/2015

An End-to-End Neural Network for Polyphonic Piano Music Transcription

We present a supervised neural network model for polyphonic piano music ...
research
10/11/2022

ConchShell: A Generative Adversarial Networks that Turns Pictures into Piano Music

We present ConchShell, a multi-modal generative adversarial framework th...
research
12/21/2022

Audio Denoising for Robust Audio Fingerprinting

Music discovery services let users identify songs from short mobile reco...
research
10/06/2022

Feasibility on Detecting Door Slamming towards Monitoring Early Signs of Domestic Violence

By using low-cost microcontrollers and TinyML, we investigate the feasib...

Please sign up or login with your details

Forgot password? Click here to reset