Speaker Clustering Using Dominant Sets

05/21/2018
by   Feliks Hibraj, et al.
0

Speaker clustering is the task of forming speaker-specific groups based on a set of utterances. In this paper, we address this task by using Dominant Sets (DS). DS is a graph-based clustering algorithm with interesting properties that fits well to our problem and has never been applied before to speaker clustering. We report on a comprehensive set of experiments on the TIMIT dataset against standard clustering techniques and specific speaker clustering methods. Moreover, we compare performances under different features by using ones learned via deep neural network directly on TIMIT and other ones extracted from a pre-trained VGGVox net. To asses the stability, we perform a sensitivity analysis on the free parameters of our method, showing that performance is stable under parameter changes. The extensive experimentation carried out confirms the validity of the proposed method, reporting state-of-the-art results under three different standard metrics. We also report reference baseline results for speaker clustering on the entire TIMIT dataset for the first time.

READ FULL TEXT
research
11/27/2018

Speaker Diarization With Lexical Information

This work presents a novel approach to leverage lexical information for ...
research
07/22/2020

Frank-Wolfe Optimization for Dominant Set Clustering

We study Frank-Wolfe algorithms – standard, pairwise, and away-steps – f...
research
10/28/2017

Speaker Diarization with LSTM

For many years, i-vector based speaker embedding techniques were the dom...
research
02/20/2023

Towards Measuring and Scoring Speaker Diarization Fairness

Speaker diarization, or the task of finding "who spoke and when", is now...
research
09/14/2021

Self-Supervised Metric Learning With Graph Clustering For Speaker Diarization

In this paper, we propose a novel algorithm for speaker diarization usin...
research
11/18/2019

Language Aided Speaker Diarization Using Speaker Role Information

Speaker diarization relies on the assumption that acoustic embeddings fr...
research
04/06/2021

Speaker Diarization using Two-pass Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings

Many modern systems for speaker diarization, such as the recently-develo...

Please sign up or login with your details

Forgot password? Click here to reset