Kushal Lakhotia

research

∙ 03/14/2022

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities

Transfer learning has proven to be crucial in advancing the state of spe...

0 Hsiang-Sheng Tsai, et al. ∙

research

∙ 02/15/2022

textless-lib: a Library for Textless Spoken Language Processing

Textless spoken language processing research aims to extend the applicab...

11 Eugene Kharitonov, et al. ∙

research

∙ 01/05/2022

Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction

Video recordings of speech contain correlated audio and visual informati...

0 Bowen Shi, et al. ∙

research

∙ 11/17/2021

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale

This paper presents XLS-R, a large-scale model for cross-lingual speech ...

0 Arun Babu, et al. ∙

research

∙ 10/13/2021

Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

Despite their recent popularity and well known advantages, dense retriev...

0 Xilun Chen, et al. ∙

research

∙ 09/07/2021

Text-Free Prosody-Aware Generative Spoken Language Modeling

Speech pre-training has primarily demonstrated efficacy on classificatio...

14 Eugene Kharitonov, et al. ∙

research

∙ 07/28/2021

Domain-matched Pre-training Tasks for Dense Retrieval

Pre-training on larger datasets with ever increasing model size is now a...

0 Barlas Oguz, et al. ∙

research

∙ 06/14/2021

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

Self-supervised approaches for speech representation learning are challe...

0 Wei-Ning Hsu, et al. ∙

research

∙ 05/03/2021

SUPERB: Speech processing Universal PERformance Benchmark

Self-supervised learning (SSL) has proven vital for advancing research i...

0 Shu-wen Yang, et al. ∙

research

∙ 04/01/2021

Speech Resynthesis from Discrete Disentangled Self-Supervised Representations

We propose using self-supervised discrete representations for the task o...

10 Adam Polyak, et al. ∙

research

∙ 02/01/2021

Generative Spoken Language Modeling from Raw Audio

Generative spoken language modeling involves learning jointly the acoust...

11 Kushal Lakhotia, et al. ∙

research

∙ 12/31/2020

FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation

Natural language (NL) explanations of model predictions are gaining popu...

0 Kushal Lakhotia, et al. ∙

research

∙ 12/12/2018

PyText: A Seamless Path from NLP research to production

We introduce PyText - a deep learning based NLP modeling framework built...

0 Ahmed Aly, et al. ∙

Kushal Lakhotia

Featured Co-authors

Sign in with Google

Consider DeepAI Pro