Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction

10/28/2022
by   Ming Cheng, et al.
0

Target-speaker voice activity detection is currently a promising approach for speaker diarization in complex acoustic environments. This paper presents a novel Sequence-to-Sequence Target-Speaker Voice Activity Detection (Seq2Seq-TSVAD) method that can efficiently address the joint modeling of large-scale speakers and predict high-resolution voice activities. Experimental results show that larger speaker capacity and higher output resolution can significantly reduce the diarization error rate (DER), which achieves the new state-of-the-art performance of 4.55 Track 1 of the DIHARD-III evaluation set under the widely-used evaluation metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2021

The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge

This report describes the submission of the DKU-DukeECE-Lenovo team to t...
research
05/14/2020

Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario

Speaker diarization for real-life scenarios is an extremely challenging ...
research
12/06/2022

BC-VAD: A Robust Bone Conduction Voice Activity Detection

Voice Activity Detection (VAD) is a fundamental module in many audio app...
research
03/07/2023

TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings

Since diarization and source separation of meeting data are closely rela...
research
06/18/2020

Adversarially Trained Multi-Singer Sequence-To-Sequence Singing Synthesizer

This paper presents a high quality singing synthesizer that is able to m...
research
07/15/2023

Single and Multi-Speaker Cloned Voice Detection: From Perceptual to Learned Features

Synthetic-voice cloning technologies have seen significant advances in r...
research
11/02/2016

The Intelligent Voice 2016 Speaker Recognition System

This paper presents the Intelligent Voice (IV) system submitted to the N...

Please sign up or login with your details

Forgot password? Click here to reset