Lightweight Speaker Verification for Online Identification of New Speakers with Short Segments

03/06/2020
by   Ivette Vélez, et al.
0

Verifying if two audio segments belong to the same speaker has been recently put forward as a flexible way to carry out speaker identification, since it does not require to be re-trained when new speakers appear on the auditory scene. However, many of the current techniques employ a considerably high amount of memory, and require a specific minimum audio segment length to obtain good performances. This limits the applicability in areas such as service robots, internet of things and virtual assistants. In this work we propose a BLSTM-based model that reaches a level of performance comparable to the current state of the art when using short input audio segments, while requiring a considerably less amount of memory. Further, as far as we know, a complete speaker identification system has not been reported using this verification paradigm. Thus, we present a complete online speaker identifier, based on a simple voting system that shows that the proposed BLSTM-based model and the current state of the art are similarly accurate at identifying speakers online.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2023

Margin-Mixup: A Method for Robust Speaker Verification in Multi-Speaker Audio

This paper is concerned with the task of speaker verification on audio w...
research
04/30/2019

Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support

We present a Cycle-GAN based many-to-many voice conversion method that c...
research
02/20/2018

Fitting New Speakers Based on a Short Untranscribed Sample

Learning-based Text To Speech systems have the potential to generalize f...
research
02/29/2020

Voice Separation with an Unknown Number of Multiple Speakers

We present a new method for separating a mixed audio sequence, in which ...
research
09/11/2018

One-Shot Speaker Identification for a Service Robot using a CNN-based Generic Verifier

In service robotics, there is an interest to identify the user by voice ...
research
07/25/2022

Unsupervised Speaker Diarization that is Agnostic to Language, Overlap-Aware, and Tuning Free

Podcasts are conversational in nature and speaker changes are frequent –...
research
07/01/2022

Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones

Speaker identification in noisy audio recordings, specifically those fro...

Please sign up or login with your details

Forgot password? Click here to reset