Domestic activities clustering from audio recordings using convolutional capsule autoencoder network

05/08/2021
by   Ziheng Lin, et al.
0

Recent efforts have been made on domestic activities classification from audio recordings, especially the works submitted to the challenge of DCASE (Detection and Classification of Acoustic Scenes and Events) since 2018. In contrast, few studies were done on domestic activities clustering, which is a newly emerging problem. Domestic activities clustering from audio recordings aims at merging audio clips which belong to the same class of domestic activity into a single cluster. Domestic activities clustering is an effective way for unsupervised estimation of daily activities performed in home environment. In this study, we propose a method for domestic activities clustering using a convolutional capsule autoencoder network (CCAN). In the method, the deep embeddings are learned by the autoencoder in the CCAN, while the deep embeddings which belong to the same class of domestic activities are merged into a single cluster by a clustering layer in the CCAN. Evaluated on a public dataset adopted in DCASE-2018 Task 5, the results show that the proposed method outperforms state-of-the-art methods in terms of the metrics of clustering accuracy and normalized mutual information.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2022

Domestic Activity Clustering from Audio via Depthwise Separable Convolutional Autoencoder Network

Automatic estimation of domestic activities from audio can be used to so...
research
06/09/2023

Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration

Recent efforts have been made on acoustic scene classification in the au...
research
06/09/2023

Domestic Activities Classification from Audio Recordings Using Multi-scale Dilated Depthwise Separable Convolutional Network

Domestic activities classification (DAC) from audio recordings aims at c...
research
07/01/2021

Audiovisual Singing Voice Separation

Separating a song into vocal and accompaniment components is an active r...
research
08/20/2021

Suspicious ARP Activity Detection and Clustering Based on Autoencoder Neural Networks

The rapidly increasing number of smart devices on the Internet necessita...
research
07/30/2018

DCASE 2018 Challenge - Task 5: Monitoring of domestic activities based on multi-channel acoustics

The DCASE 2018 Challenge consists of five tasks related to automatic cla...
research
05/15/2020

An Auto Encoder For Audio Dolphin Communication

Research in dolphin communication and cognition requires detailed inspec...

Please sign up or login with your details

Forgot password? Click here to reset