Frame-level multi-channel speaker verification with large-scale ad-hoc microphone arrays

10/12/2021
by   Chengdong Liang, et al.
0

Ad-hoc microphone arrays has recieved attention, in which the number and arrangement of microphones are unknown. Traditional multi-channel processing methods can not be directly used in ad-hoc. Recently, to solve this problem, an utterance-level ASV with ad-hoc microphone arrays has been proposed, which first extracts utterance-level speaker embeddings from each channel of an ad-hoc microphone array, and then fuses the embeddings for the final verification. However, this method cannot make full use of the cross-channel information. In this paper, we present a novel multi-channel ASV model at the frame-level. Specifically, we add spatio-temporal processing blocks (STB) before the pooling layer, which models the contextual relationship within and between channels and across time, respectively. The channel-attended outputs from STB are sent to the pooling layer to obtain an utterance-level speaker representation. Experimental results demonstrate the effectiveness of the proposed method.

READ FULL TEXT
research
07/01/2021

Attention-based multi-channel speaker verification with ad-hoc microphone arrays

Recently, ad-hoc microphone array has been widely studied. Unlike tradit...
research
12/01/2020

Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation

Recently, the research on ad-hoc microphone arrays with deep learning ha...
research
07/03/2023

Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays

The performance of speaker verification degrades significantly in advers...
research
06/25/2020

Will Dynamic Arrays finally change the way Models are built?

Spreadsheets offer a supremely successful and intuitive means of process...
research
03/29/2021

Scaling sparsemax based channel selection for speech recognition with ad-hoc microphone arrays

Recently, speech recognition with ad-hoc microphone arrays has received ...
research
08/31/2023

Excel as a Turing-complete Functional Programming Environment

Since the calculation engine of Excel was the subject of a major upgrade...
research
05/20/2023

Joining the Conversation: Towards Language Acquisition for Ad Hoc Team Play

In this paper, we propose and consider the problem of cooperative langua...

Please sign up or login with your details

Forgot password? Click here to reset