Laureano Moro-Velázquez

research

∙ 09/08/2023

Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning

Visually grounded speech systems learn from paired images and their spok...

0 Saurabhchand Bhati, et al. ∙

research

∙ 03/07/2023

Stabilized training of joint energy-based models and their practical applications

The recently proposed Joint Energy-based Model (JEM) interprets discrimi...

0 Martin Sustek, et al. ∙

research

∙ 08/10/2022

Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech

In recent studies, self-supervised pre-trained models tend to outperform...

0 Jaejin Cho, et al. ∙

research

∙ 08/10/2022

Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations

Considering the abundance of unlabeled speech data and the high labeling...

0 Jaejin Cho, et al. ∙

research

∙ 03/30/2022

Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification

Speech systems developed for a particular choice of acoustic domain and ...

0 Saurabh Kataria, et al. ∙

research

∙ 01/26/2022

Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition

The high cost of data acquisition makes Automatic Speech Recognition (AS...

6 Piotr Żelasko, et al. ∙

research

∙ 10/05/2021

Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding

Typically, unsupervised segmentation of speech into the phone and word-l...

0 Saurabhchand Bhati, et al. ∙

research

∙ 09/13/2021

Beyond Isolated Utterances: Conversational Emotion Recognition

Speech emotion recognition is the task of recognizing the speaker's emot...

0 Raghavendra Pappagari, et al. ∙

research

∙ 06/15/2021

Pathological voice adaptation with autoencoder-based voice conversion

In this paper, we propose a new approach to pathological speech synthesi...

0 Marc Illa, et al. ∙

research

∙ 06/03/2021

Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation

Automatic detection of phoneme or word-like units is one of the core obj...

0 Saurabhchand Bhati, et al. ∙

research

∙ 04/02/2021

Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation

This paper tackles automatically discovering phone-like acoustic units (...

0 Siyuan Feng, et al. ∙

research

∙ 01/22/2021

Adversarial Attacks and Defenses for Speaker Identification Systems

Research in automatic speaker recognition (SR) has been undertaken for s...

0 Sonal Joshi, et al. ∙

research

∙ 11/29/2020

Artificial Intelligence applied to chest X-Ray images for the automatic detection of COVID-19. A thoughtful evaluation approach

Current standard protocols used in the clinic for diagnosing COVID-19 in...

0 Julian D. Arias-Londoño, et al. ∙

research

∙ 10/27/2020

CopyPaste: An Augmentation Method for Speech Emotion Recognition

Data augmentation is a widely used strategy for training robust machine ...

0 Raghavendra Pappagari, et al. ∙

research

∙ 10/22/2020

How Phonotactics Affect Multilingual and Zero-shot ASR Performance

The idea of combining multiple languages' recordings to train a single a...

0 Siyuan Feng, et al. ∙

research

∙ 05/16/2020

That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages

Only a handful of the world's languages are abundant with the resources ...

0 Piotr Żelasko, et al. ∙

Laureano Moro-Velázquez

Featured Co-authors

Sign in with Google

Consider DeepAI Pro