Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

06/28/2023
by   Jiuxin Lin, et al.
0

Previously, Target Speaker Extraction (TSE) has yielded outstanding performance in certain application scenarios for speech enhancement and source separation. However, obtaining auxiliary speaker-related information is still challenging in noisy environments with significant reverberation. inspired by the recently proposed distance-based sound separation, we propose the near sound (NS) extractor, which leverages distance information for TSE to reliably extract speaker information without requiring previous speaker enrolment, called speaker embedding self-enrollment (SESE). Full- sub-band modeling is introduced to enhance our NS-Extractor's adaptability towards environments with significant reverberation. Experimental results on several cross-datasets demonstrate the effectiveness of our improvements and the excellent performance of our proposed NS-Extractor in different application scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2019

The sound of my voice: speaker representation loss for target voice separation

Research on content and style representations has been widely studied in...
research
09/14/2023

Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models

Background noise considerably reduces the accuracy and reliability of sp...
research
10/25/2020

Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain

This paper introduces an improved target speaker extractor, referred to ...
research
06/28/2023

MC-SpEx: Towards Effective Speaker Extraction with Multi-Scale Interfusion and Conditional Speaker Modulation

The previous SpEx+ has yielded outstanding performance in speaker extrac...
research
06/27/2022

Extended U-Net for Speaker Verification in Noisy Environments

Background noise is a well-known factor that deteriorates the accuracy a...
research
07/01/2022

Distance-Based Sound Separation

We propose the novel task of distance-based sound separation, where soun...
research
09/17/2021

Speaker Placement Agnosticism: Improving the Distance-based Amplitude Panning Algorithm

Lossius et. al introduced the distance-based amplitude panning algorithm...

Please sign up or login with your details

Forgot password? Click here to reset