Similarity-and-Independence-Aware Beamformer: Method for Target Source Extraction using Magnitude Spectrogram as Reference

06/01/2020
by   Atsuo Hiroe, et al.
0

This study presents a novel method called the similarity-and-independence-aware beamformer (SIBF) for source extraction. The SIBF can extract the target signal using its rough magnitude spectrogram as the reference signal. The advantage of SIBF lies in that it can obtain an accurate target signal, compared to the spectrogram generated by the target-enhancing methods, such as the speech enhancement based on deep neural networks (DNNs). To realize such extraction, we extend the framework of the deflationary independent component analysis, by considering the similarity between the reference and extracted target, as well as the mutual independence among all potential sources. To solve this extraction problem by the maximum-likelihood estimation, we introduce two types of source models that can reflect the similarity. Using the CHiME3 dataset, the experimental results show that the SIBF can extract the target signal more accurate than the reference generated by the DNN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2021

Similarity-and-Independence-Aware Beamformer with Iterative Casting and Boost Start for Target Source Extraction Using Reference

Target source extraction is significant for improving human speech intel...
research
06/16/2022

Adversarial Privacy Protection on Speech Enhancement

Speech is easily leaked imperceptibly, such as being recorded by mobile ...
research
08/08/2023

Target Speech Extraction with Conditional Diffusion Model

Diffusion model-based speech enhancement has received increased attentio...
research
07/21/2021

Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate

Remixing separated audio sources trades off interferer attenuation again...
research
02/26/2023

DFSNet: A Steerable Neural Beamformer Invariant to Microphone Array Configuration for Real-Time, Low-Latency Speech Enhancement

Invariance to microphone array configuration is a rare attribute in neur...

Please sign up or login with your details

Forgot password? Click here to reset