On Filter Generalization for Music Bandwidth Extension Using Deep Neural Networks

11/14/2020
by   Serkan Sulun, et al.
0

In this paper, we address a sub-topic of the broad domain of audio enhancement, namely musical audio bandwidth extension. We formulate the bandwidth extension problem using deep neural networks, where a band-limited signal is provided as input to the network, with the goal of reconstructing a full-bandwidth output. Our main contribution centers on the impact of the choice of low pass filter when training and subsequently testing the network. For two different state of the art deep architectures, ResNet and U-Net, we demonstrate that when the training and testing filters are matched, improvements in signal-to-noise ratio (SNR) of up to 7dB can be obtained. However, when these filters differ, the improvement falls considerably and under some training conditions results in a lower SNR than the band-limited input. To circumvent this apparent overfitting to filter shape, we propose a data augmentation strategy which utilizes multiple low pass filters during training and leads to improved generalization to unseen filtering conditions at test time.

READ FULL TEXT

page 1

page 4

page 9

research
09/29/2019

FaSNet: Low-latency Adaptive Beamforming for Multi-microphone Audio Processing

Beamforming has been extensively investigated for multi-channel audio pr...
research
07/10/2021

Beyond Low-pass Filtering: Graph Convolutional Networks with Automatic Filtering

Graph convolutional networks are becoming indispensable for deep learnin...
research
12/26/2019

Efficient Training of Deep Classifiers for Wireless Source Identification using Test SNR Estimates

We investigate the potential of training time reduction for deep learnin...
research
04/07/2020

SNR-Based Features and Diverse Training Data for Robust DNN-Based Speech Enhancement

This paper analyzes the generalization of speech enhancement algorithms ...
research
06/06/2022

Continuous-Time Analog Filters for Audio Edge Intelligence: Review and Analysis on Design Techniques

Silicon cochlea designs capture the functionality of the biological coch...
research
03/17/2023

Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture

This paper presents a configurable version of Extreme Bandwidth Extensio...
research
03/21/2018

Efficient Bandwidth Estimation in Two-dimensional Filtered Backprojection Reconstruction

A method to efficiently estimate the bandwidth of the reconstruction fil...

Please sign up or login with your details

Forgot password? Click here to reset