Improved feature extraction for CRNN-based multiple sound source localization

05/05/2021
by   Pierre-Amaury Grumiaux, et al.
0

In this work, we propose to extend a state-of-the-art multi-source localization system based on a convolutional recurrent neural network and Ambisonics signals. We significantly improve the performance of the baseline network by changing the layout between convolutional and pooling layers. We propose several configurations with more convolutional layers and smaller pooling sizes in-between, so that less information is lost across the layers, leading to a better feature extraction. In parallel, we test the system's ability to localize up to 3 sources, in which case the improved feature extraction provides the most significant boost in accuracy. We evaluate and compare these improved configurations on synthetic and real-world data. The obtained results show a quite substantial improvement of the multiple sound source localization performance over the baseline network.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2020

Learning Multiple Sound Source 2D Localization

In this paper, we propose novel deep learning based algorithms for multi...
research
05/14/2015

A PCA-Based Convolutional Network

In this paper, we propose a novel unsupervised deep learning model, call...
research
07/23/2021

SALADnet: Self-Attentive multisource Localization in the Ambisonics Domain

In this work, we propose a novel self-attention based neural network for...
research
02/18/2016

Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data

Traditional convolutional layers extract features from patches of data b...
research
09/20/2023

CNN-based local features for navigation near an asteroid

This article addresses the challenge of vision-based proximity navigatio...
research
11/17/2021

Exploring Unsupervised Learning Methods for Automated Protocol Analysis

The ability to analyse and differentiate network protocol traffic is cru...
research
10/18/2022

Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection

In this technical report, the systems we submitted for subtask 4 of the ...

Please sign up or login with your details

Forgot password? Click here to reset