The goal of universal audio representation learning is to obtain foundat...
In recent years, self-supervised learning (SSL) models have produced
pro...
Domain gaps are among the most relevant roadblocks in the clinical
trans...
Self-supervised representations of speech are currently being widely use...
A number of different performance metrics are commonly used in the machi...
Speaker verification (SV) systems are currently being used to make sensi...
Spoken language recognition (SLR) refers to the automatic process used t...
This work aims to understand the impact of class imbalance on the perfor...
Phone-level pronunciation scoring is a challenging task, with performanc...
Transformers have revolutionized the world of deep learning, specially i...
Emotion recognition datasets are relatively small, making the use of the...
Out of a hundred trials, how many errors does your speaker verifier make...
Research has shown that trust is an essential aspect of human-computer
i...
In this paper, we address the problem of speaker verification in conditi...
Research has shown that trust is an essential aspect of human-computer
i...
This paper describes a novel protocol for collecting speech data from
su...
In a recent work, we presented a discriminative backend for speaker
veri...
We present a scoring approach for speaker verification that mimics the
s...
Probabilistic linear discriminant analysis (PLDA) is a method used for
b...
The joint PLDA model, is a generalization of PLDA where the nuisance var...
Standard probabilistic discriminant analysis (PLDA) for speaker recognit...