b'Zeyu Jin'

research

∙ 06/02/2023

Efficient Spoken Language Recognition via Multilabel Classification

Spoken language recognition (SLR) is the task of automatically identifyi...

0 Oriol Nieto, et al. ∙

research

∙ 06/27/2022

Audio Similarity is Unreliable as a Proxy for Audio Quality

Many audio processing tasks require perceptual assessment. However, the ...

0 Pranay Manocha, et al. ∙

research

∙ 04/28/2022

Music Enhancement via Image Translation and Vocoding

Consumer-grade music recordings such as those captured by mobile devices...

1 Nikhil Kandpal, et al. ∙

research

∙ 03/06/2022

HEAR 2021: Holistic Evaluation of Audio Representations

What audio embedding approach generalizes best to a wide range of downst...

17 Joseph Turian, et al. ∙

research

∙ 10/05/2021

Neural Pitch-Shifting and Time-Stretching with Controllable LPCNet

Modifying the pitch and timing of an audio signal are fundamental audio ...

0 Max Morrison, et al. ∙

research

∙ 09/02/2021

Controllable deep melody generation via hierarchical music structure representation

Recent advances in deep learning have expanded possibilities to generate...

1 Shuqi Dai, et al. ∙

research

∙ 02/16/2021

Context-Aware Prosody Correction for Text-Based Speech Editing

Text-based speech editors expedite the process of editing speech recordi...

0 Max Morrison, et al. ∙

research

∙ 02/09/2021

CDPAM: Contrastive learning for perceptual audio similarity

Many speech processing methods based on deep learning require an automat...

0 Pranay Manocha, et al. ∙

research

∙ 02/06/2021

High Order Numerical Homogenization for Dissipative Ordinary Differential Equations

We propose a high order numerical homogenization method for dissipative ...

0 Zeyu Jin, et al. ∙

research

∙ 08/09/2020

Metric Learning vs Classification for Disentangled Music Representation Learning

Deep representation learning offers a powerful paradigm for mapping inpu...

0 Jongpil Lee, et al. ∙

research

∙ 08/09/2020

Disentangled Multidimensional Metric Learning for Music Similarity

Music similarity search is useful for a variety of creative tasks such a...

0 Jongpil Lee, et al. ∙

research

∙ 08/07/2020

Controllable Neural Prosody Synthesis

Speech synthesis has recently seen significant improvements in fidelity,...

0 Max Morrison, et al. ∙

research

∙ 06/10/2020

HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Real-world audio recordings are often degraded by factors such as noise,...

0 Jiaqi Su, et al. ∙

research

∙ 04/15/2020

F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder

Non-parallel many-to-many voice conversion remains an interesting but ch...

7 Kaizhi Qian, et al. ∙

research

∙ 01/13/2020

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Assessment of many audio processing tasks relies on subjective evaluatio...

0 Pranay Manocha, et al. ∙

research

∙ 06/04/2019

Text-based Editing of Talking-head Video

Editing talking-head video to change the speech content or to remove fil...

4 Ohad Fried, et al. ∙

Zeyu Jin

Featured Co-authors

Sign in with Google

Consider DeepAI Pro