Fully Supervised Speaker Diarization

10/10/2018
by   Aonan Zhang, et al.
0

In this paper, we propose a fully supervised speaker diarization approach, named unbounded interleaved-state recurrent neural networks (UIS-RNN). Given extracted speaker-discriminative embeddings (a.k.a. d-vectors) from input utterances, each individual speaker is modeled by a parameter-sharing RNN, while the RNN states for different speakers interleave in the time domain. This RNN is naturally integrated with a distance-dependent Chinese restaurant process (ddCRP) to accommodate an unknown number of speakers. Our system is fully supervised and is able to learn from examples where time-stamped speaker labels are annotated. We achieved a 7.6 2000 CALLHOME, which is better than the state-of-the-art method using spectral clustering. Moreover, our method decodes in an online fashion while most state-of-the-art systems rely on offline clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2019

Supervised online diarization with sample mean loss for multi-domain data

Recently, a fully supervised speaker diarization approach was proposed (...
research
10/22/2019

Discriminative Neural Clustering for Speaker Diarisation

This paper proposes a novel method for supervised data clustering. The c...
research
06/10/2020

Speaker Diarization: Using Recurrent Neural Networks

Speaker Diarization is the problem of separating speakers in an audio. T...
research
08/09/2017

Speaker Diarization using Deep Recurrent Convolutional Neural Networks for Speaker Embeddings

In this paper we propose a new method of speaker diarization that employ...
research
08/30/2019

Enhancements for Audio-only Diarization Systems

In this paper two different approaches to enhance the performance of the...
research
09/12/2017

Addressee and Response Selection in Multi-Party Conversations with Speaker Interaction RNNs

In this paper, we study the problem of addressee and response selection ...
research
11/09/2022

Absolute decision corrupts absolutely: conservative online speaker diarisation

Our focus lies in developing an online speaker diarisation framework whi...

Please sign up or login with your details

Forgot password? Click here to reset