Wei-Qiang Zhang

research

∙ 06/02/2023

Task-Agnostic Structured Pruning of Speech Representation Models

Self-supervised pre-trained models such as Wav2vec2, Hubert, and WavLM h...

0 Haoyu Wang, et al. ∙

research

∙ 06/02/2023

DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model

Multilingual self-supervised speech representation models have greatly e...

0 Haoyu Wang, et al. ∙

research

∙ 03/31/2023

Unsupervised Anomaly Detection and Localization of Machine Audio: A GAN-based Approach

Automatic detection of machine anomaly remains challenging for machine l...

0 Anbai Jiang, et al. ∙

research

∙ 03/14/2023

Cross-lingual Alzheimer's Disease detection based on paralinguistic and pre-trained features

We present our submission to the ICASSP-SPGC-2023 ADReSS-M Challenge Tas...

0 Xuchu Chen, et al. ∙

research

∙ 01/28/2023

MVKT-ECG: Efficient Single-lead ECG Classification on Multi-Label Arrhythmia by Multi-View Knowledge Transferring

The widespread emergence of smart devices for ECG has sparked demand for...

0 Yuzhen Qin, et al. ∙

research

∙ 01/05/2023

Expressive Speech-driven Facial Animation with controllable emotions

It is in high demand to generate facial animation with high realism, but...

0 Yutong Chen, et al. ∙

research

∙ 12/20/2022

Exploring Effective Fusion Algorithms for Speech Based Self-Supervised Learning Models

Self-supervised learning (SSL) has achieved great success in various are...

0 Changli Tang, et al. ∙

research

∙ 11/02/2022

LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification

Although the security of automatic speaker verification (ASV) is serious...

0 Xing Chen, et al. ∙

research

∙ 10/30/2022

Symmetric Saliency-based Adversarial Attack To Speaker Identification

Adversarial attack approaches to speaker identification either need high...

0 Jiadi Yao, et al. ∙

research

∙ 10/27/2022

Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition

Recent years have witnessed great strides in self-supervised learning (S...

0 Yujin Wang, et al. ∙

research

∙ 10/13/2022

Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models

Labeled audio data is insufficient to build satisfying speech recognitio...

0 Haoyu Wang, et al. ∙

research

∙ 10/12/2022

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Code-switching automatic speech recognition becomes one of the most chal...

0 Shuhao Deng, et al. ∙

research

∙ 06/29/2022

The THUEE System Description for the IARPA OpenASR21 Challenge

This paper describes the THUEE team's speech recognition system for the ...

0 Jing Zhao, et al. ∙

research

∙ 03/01/2022

BERT-LID: Leveraging BERT to Improve Spoken Language Identification

Language identification is a task of automatically determining the ident...

0 Yuting Nie, et al. ∙

research

∙ 08/27/2021

Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement

As the cornerstone of other important technologies, such as speech recog...

0 Yuzi Yan, et al. ∙

research

∙ 07/06/2021

AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style

While recent text to speech (TTS) models perform very well in synthesizi...

9 Yuzi Yan, et al. ∙

research

∙ 07/05/2021

DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling

Rap generation, which aims to produce lyrics and corresponding singing b...

0 Lanqing Xue, et al. ∙

research

∙ 06/13/2021

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

This paper introduces GigaSpeech, an evolving, multi-domain English spee...

0 Guoguo Chen, et al. ∙

research

∙ 12/25/2019

THUEE system description for NIST 2019 SRE CTS Challenge

This paper describes the systems submitted by the department of electron...

0 YI LIU, et al. ∙

research

∙ 03/28/2019

Hierarchical Pooling Structure for Weakly Labeled Sound Event Detection

Sound event detection with weakly labeled data is considered as a proble...

0 Ke-Xin He, et al. ∙

research

∙ 10/29/2018

Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection

In this paper, we propose a temporal-frequential attention model for sou...

0 Yu-Han Shen, et al. ∙

research

∙ 10/03/2018

SAM-GCNN: A Gated Convolutional Neural Network with Segment-Level Attention Mechanism for Home Activity Monitoring

In this paper, we propose a method for home activity monitoring. We demo...

0 Yu-Han Shen, et al. ∙

Wei-Qiang Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro