Clova Baseline System for the VoxCeleb Speaker Recognition Challenge 2020

09/29/2020
by   Hee-Soo Heo, et al.
0

This report describes our submission to the VoxCeleb Speaker Recognition Challenge (VoxSRC) at Interspeech 2020. We perform a careful analysis of speaker recognition models based on the popular ResNet architecture, and train a number of variants using a range of loss functions. Our results show significant improvements over most existing works without the use of model ensemble or post-processing. We release the training code and pre-trained models as unofficial baselines for this year's challenge.

READ FULL TEXT

page 1

page 2

page 3

research
10/29/2020

The ins and outs of speaker recognition: lessons from VoxSRC 2020

The VoxCeleb Speaker Recognition Challenge (VoxSRC) at Interspeech 2020 ...
research
06/25/2019

Naver at ActivityNet Challenge 2019 -- Task B Active Speaker Detection (AVA)

This report describes our submission to the ActivityNet Challenge at CVP...
research
10/16/2020

Tongji University Team for the VoxCeleb Speaker Recognition Challenge 2020

In this report, we describe the submission of Tongji University team to ...
research
11/04/2020

Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020

In this report, we describe our submission to the VoxCeleb Speaker Recog...
research
10/23/2020

EML System Description for VoxCeleb Speaker Diarization Challenge 2020

This technical report describes the EML submission to the first VoxCeleb...
research
01/25/2019

LOCATA challenge: speaker localization with a planar array

This document describes our submission to the 2018 LOCalization And TrAc...
research
11/03/2020

ShaneRun System Description to VoxCeleb Speaker Recognition Challenge 2020

In this report, we describe the submission of ShaneRun's team to the Vox...

Please sign up or login with your details

Forgot password? Click here to reset