Peter Wu

research

∙ 09/14/2023

CiwaGAN: Articulatory information exchange

Humans encode information into sounds by controlling articulators and de...

0 Gašper Beguš, et al. ∙

research

∙ 02/14/2023

Speaker-Independent Acoustic-to-Articulatory Speech Inversion

To build speech processing methods that can handle speech as naturally a...

0 Peter Wu, et al. ∙

research

∙ 10/27/2022

A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution

Estimation of fundamental frequency (F0) in voiced segments of speech si...

0 Yisi Liu, et al. ∙

research

∙ 10/27/2022

Articulation GAN: Unsupervised modeling of articulatory learning

Generative deep neural networks are widely used for speech synthesis, bu...

0 Gašper Beguš, et al. ∙

research

∙ 10/21/2022

Evidence of Vocal Tract Articulation in Self-Supervised Learning of Speech

Recent self-supervised learning (SSL) models have proven to learn rich r...

0 Cheol Jun Cho, et al. ∙

research

∙ 09/13/2022

Deep Speech Synthesis from Articulatory Representations

In the articulatory synthesis task, speech is synthesized from input fea...

0 Peter Wu, et al. ∙

research

∙ 03/21/2022

PACS: A Dataset for Physical Audiovisual CommonSense Reasoning

In order for AI to be safely deployed in real-world scenarios such as ho...

4 Samuel Yu, et al. ∙

research

∙ 11/02/2021

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Speech processing systems currently do not support the vast majority of ...

6 Peter Wu, et al. ∙

research

∙ 10/15/2021

ESPnet2-TTS: Extending the Edge of TTS Research

This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS)...

0 Tomoki Hayashi, et al. ∙

research

∙ 07/15/2021

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning

Learning multimodal representations involves integrating information fro...

0 Paul Pu Liang, et al. ∙

research

∙ 01/22/2021

Understanding the Tradeoffs in Client-Side Privacy for Speech Recognition

Existing approaches to ensuring privacy of user speech data primarily fo...

0 Peter Wu, et al. ∙

research

∙ 12/04/2020

Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment

The natural world is abundant with concepts expressed via visual, acoust...

6 Paul Pu Liang, et al. ∙

research

∙ 12/01/2020

Automatically Identifying Language Family from Acoustic Examples in Low Resource Scenarios

Existing multilingual speech NLP works focus on a relatively small subse...

0 Peter Wu, et al. ∙

research

∙ 11/19/2020

Oblivious DNS over HTTPS (ODoH): A Practical Privacy Enhancement to DNS

The Domain Name System (DNS) is the foundation of a human-usable Interne...

0 Sudheesh Singanamalla, et al. ∙

research

∙ 12/03/2018

LEAF: A Benchmark for Federated Settings

Modern federated networks, such as those comprised of wearable devices, ...

0 Sebastian Caldas, et al. ∙

research

∙ 04/30/2018

Machine Learning for Exam Triage

In this project, we extend the state-of-the-art CheXNet (Rajpurkar et al...

0 Xinyu Guan, et al. ∙

Peter Wu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro