Improving automated segmentation of radio shows with audio embeddings

02/12/2020
by   Oberon Berlage, et al.
0

Audio features have been proven useful for increasing the performance of automated topic segmentation systems. This study explores the novel task of using audio embeddings for automated, topically coherent segmentation of radio shows. We created three different audio embedding generators using multi-class classification tasks on three datasets from different domains. We evaluate topic segmentation performance of the audio embeddings and compare it against a text-only baseline. We find that a set-up including audio embeddings generated through a non-speech sound event classification task significantly outperforms our text-only baseline by 32.3 different classification tasks yield audio embeddings that vary in segmentation performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2019

Improving performance and inference on audio classification tasks using capsule networks

Classification of audio samples is an important part of many auditory sy...
research
08/30/2021

Unsupervised Learning of Deep Features for Music Segmentation

Music segmentation refers to the dual problem of identifying boundaries ...
research
05/30/2023

Audio classification using ML methods

Machine Learning systems have achieved outstanding performance in differ...
research
06/21/2023

A Multimodal Prototypical Approach for Unsupervised Sound Classification

In the context of environmental sound classification, the adaptability o...
research
02/19/2021

Artificially Synthesising Data for Audio Classification and Segmentation to Improve Speech and Music Detection in Radio Broadcast

Segmenting audio into homogeneous sections such as music and speech help...
research
06/24/2021

Unsupervised Topic Segmentation of Meetings with BERT Embeddings

Topic segmentation of meetings is the task of dividing multi-person meet...
research
03/20/2022

A Study on Robustness to Perturbations for Representations of Environmental Sound

Audio applications involving environmental sound analysis increasingly u...

Please sign up or login with your details

Forgot password? Click here to reset