Cheng Yu

research

∙ 09/19/2023

Using fine-tuning and min lookahead beam search to improve Whisper

The performance of Whisper in low-resource languages is still far from p...

0 Andrea Do, et al. ∙

research

∙ 06/19/2023

Learning operators for identifying weak solutions to the Navier-Stokes equations

This paper focuses on investigating the learning operators for identifyi...

0 Dixi Wang, et al. ∙

research

∙ 06/08/2023

Matrix GARCH Model: Inference and Application

Matrix-variate time series data are largely available in applications. H...

0 Cheng Yu, et al. ∙

research

∙ 03/31/2022

Perceptual Contrast Stretching on Target Feature for Speech Enhancement

Speech enhancement (SE) performance has improved considerably since the ...

0 Rong Chao, et al. ∙

research

∙ 02/10/2022

Conditional Diffusion Probabilistic Model for Speech Enhancement

Speech enhancement is a critical component of many user-oriented audio a...

0 Yen-Ju Lu, et al. ∙

research

∙ 11/10/2021

OSSEM: one-shot speaker adaptive speech enhancement using meta learning

Although deep learning (DL) has achieved notable progress in speech enha...

0 Cheng Yu, et al. ∙

research

∙ 11/10/2021

HASA-net: A non-intrusive hearing-aid speech assessment network

Without the need of a clean reference, non-intrusive speech assessment m...

0 Hsin-Tien Chiang, et al. ∙

research

∙ 11/08/2021

SEOFP-NET: Compression and Acceleration of Deep Neural Networks for Speech Enhancement Using Sign-Exponent-Only Floating-Points

Numerous compression and acceleration strategies have achieved outstandi...

0 Yu-Chen Lin, et al. ∙

research

∙ 10/12/2021

MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech

Most of the deep learning-based speech enhancement models are learned in...

49 Szu-Wei Fu, et al. ∙

research

∙ 08/23/2021

Adaptable GAN Encoders for Image Reconstruction via Multi-type Latent Vectors with Two-scale Attentions

Although current deep generative adversarial networks (GANs) could synth...

16 Cheng Yu, et al. ∙

research

∙ 06/09/2021

Intermittent Speech Recovery

A large number of Internet of Things (IoT) devices today are powered by ...

0 Yu-Chen Lin, et al. ∙

research

∙ 04/08/2021

MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement

The discrepancy between the cost function used for training a speech enh...

30 Szu-Wei Fu, et al. ∙

research

∙ 01/07/2021

Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario

Multi-task learning (MTL) and attention mechanism have been proven to ef...

0 Chiang-Jen Peng, et al. ∙

research

∙ 10/28/2020

Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement

Speech enhancement (SE) aims to improve speech quality and intelligibili...

0 Tsun-An Hsieh, et al. ∙

research

∙ 10/13/2020

Multi-Scale Networks for 3D Human Pose Estimation with Inference Stage Optimization

Estimating 3D human poses from a monocular video is still a challenging ...

4 Cheng Yu, et al. ∙

research

∙ 10/08/2020

HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation

This work describes the speaker verification system developed by Human L...

0 Rohan Kumar Das, et al. ∙

research

∙ 06/18/2020

Boosting Objective Scores of Speech Enhancement Model through MetricGAN Post-Processing

The Transformer architecture has shown its superior ability than recurre...

0 Szu-Wei Fu, et al. ∙

research

∙ 01/06/2020

Speech Enhancement based on Denoising Autoencoder with Multi-branched Encoders

Deep learning-based models have greatly advanced the performance of spee...

0 Cheng Yu, et al. ∙

research

∙ 11/22/2019

Time-Domain Multi-modal Bone/air Conducted Speech Enhancement

Integrating modalities, such as video signals with speech, has been show...

0 Cheng Yu, et al. ∙

research

∙ 05/31/2019

Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques

Most recent studies on deep learning based speech enhancement (SE) focus...

0 Jyun-Yi Wu, et al. ∙

Cheng Yu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro