DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team

02/23/2020
by   Qingjian Lin, et al.
10

In this paper, we present the submitted system for the second DIHARD Speech Diarization Challenge from the DKULENOVO team. Our diarization system includes multiple modules, namely voice activity detection (VAD), segmentation, speaker embedding extraction, similarity scoring, clustering, resegmentation and overlap detection. For each module, we explore different techniques to enhance performance. Our final submission employs the ResNet-LSTM based VAD, the Deep ResNet based speaker embedding, the LSTM based similarity scoring and spectral clustering. Variational Bayes (VB) diarization is applied in the resegmentation stage and overlap detection also brings slight improvement. Our proposed system achieves 18.84 have reduced the DERs by 27.5 baselines, we believe that the diarization task is still very difficult.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2021

The DKU-Duke-Lenovo System Description for the Third DIHARD Speech Diarization Challenge

In this paper, we present the submitted system for the third DIHARD Spee...
research
07/23/2019

LSTM based Similarity Measurement with Spectral Clustering for Speaker Diarization

More and more neural network approaches have achieved considerable impro...
research
02/06/2022

Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge

In this paper, we present the speaker diarization system for the Multi-c...
research
12/02/2019

Speaker detection in the wild: Lessons learned from JSALT 2019

This paper presents the problems and solutions addressed at the JSALT wo...
research
06/08/2021

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

In this paper, we present a conditional multitask learning method for en...
research
11/05/2020

Multi-class Spectral Clustering with Overlaps for Speaker Diarization

This paper describes a method for overlap-aware speaker diarization. Giv...
research
05/27/2019

UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge

In this paper, we present our system developed by the team from the New ...

Please sign up or login with your details

Forgot password? Click here to reset