Multi-task learning of speech and speaker recognition

02/24/2023
by   Nik Vaessen, et al.
0

We study multi-task learning for two orthogonal speech technology tasks: speech and speaker recognition. We use wav2vec2 as a base architecture with two task-specific output heads. We experiment with different methods to mix speaker and speech information in the output embedding sequence, and propose a simple dynamic approach to balance the speech and speaker recognition loss functions. Our multi-task learning networks can produce a shared speaker and speech embedding, which are evaluated on the LibriSpeech and VoxCeleb test sets, and achieve a performance comparable to separate single-task models. Code is available at https://github.com/nikvaessen/2022-repo-mt-w2v2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2016

Multi-task Recurrent Model for Speech and Speaker Recognition

Although highly correlated, speech and speaker recognition have been reg...
research
09/30/2021

Fine-tuning wav2vec2 for speaker recognition

This paper explores applying the wav2vec2 framework to speaker recogniti...
research
11/15/2022

Cross-Stitched Multi-task Dual Recursive Networks for Unified Single Image Deraining and Desnowing

We present the Cross-stitched Multi-task Unified Dual Recursive Network ...
research
11/18/2019

Multi-Task Learning of Height and Semantics from Aerial Images

Aerial or satellite imagery is a great source for land surface analysis,...
research
10/26/2019

Sum-Product Networks for Robust Automatic Speaker Recognition

The performance of a speaker recognition system degrades considerably in...
research
01/26/2020

Multi-task Learning for Speaker Verification and Voice Trigger Detection

Automatic speech transcription and speaker recognition are usually treat...
research
05/07/2021

SpeechNet: A Universal Modularized Model for Speech Processing Tasks

There is a wide variety of speech processing tasks ranging from extracti...

Please sign up or login with your details

Forgot password? Click here to reset