Xiao-Lei Zhang

research

∙ 07/03/2023

Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays

The performance of speaker verification degrades significantly in advers...

0 Yijiang Chen, et al. ∙

research

∙ 02/21/2023

Interpretable Spectrum Transformation Attacks to Speaker Recognition

The success of adversarial attacks to speaker recognition is mainly in w...

0 Jiadi Yao, et al. ∙

research

∙ 11/02/2022

Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames

Recently, the unified streaming and non-streaming two-pass (U2/U2++) end...

0 Chengdong Liang, et al. ∙

research

∙ 11/02/2022

LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification

Although the security of automatic speaker verification (ASV) is serious...

0 Xing Chen, et al. ∙

research

∙ 10/30/2022

Symmetric Saliency-based Adversarial Attack To Speaker Identification

Adversarial attack approaches to speaker identification either need high...

0 Jiadi Yao, et al. ∙

research

∙ 10/30/2022

WeKws: A production first small-footprint end-to-end Keyword Spotting Toolkit

Keyword spotting (KWS) enables speech-based user interaction and gradual...

0 Jie Wang, et al. ∙

research

∙ 10/19/2022

Deep Learning Based Two-dimensional Speaker Localization With Large Ad-hoc Microphone Arrays

Deep learning based speaker localization has shown its advantage in reve...

0 Shupei Liu, et al. ∙

research

∙ 10/16/2022

End-to-end Two-dimensional Sound Source Localization With Ad-hoc Microphone Arrays

Conventional sound source localization methods are mostly based on a sin...

0 Yijun Gong, et al. ∙

research

∙ 07/25/2022

Improving Pseudo Labels With Intra-Class Similarity for Unsupervised Domain Adaptation

Unsupervised domain adaptation (UDA) transfers knowledge from a label-ri...

4 Jie Wang, et al. ∙

research

∙ 10/12/2021

Frame-level multi-channel speaker verification with large-scale ad-hoc microphone arrays

Ad-hoc microphone arrays has recieved attention, in which the number and...

0 Chengdong Liang, et al. ∙

research

∙ 07/13/2021

Conformer-based End-to-end Speech Recognition With Rotary Position Embedding

Transformer-based end-to-end speech recognition models have received con...

0 Shengqiang Li, et al. ∙

research

∙ 07/13/2021

AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data

Deep neural networks provide effective solutions to small-footprint keyw...

0 Menglong Xu, et al. ∙

research

∙ 07/05/2021

Unsupervised Ensemble Selection for Multilayer Bootstrap Networks

Multilayer bootstrap network (MBN), which is a recent simple unsupervise...

0 Xiao-Lei Zhang, et al. ∙

research

∙ 07/01/2021

Attention-based multi-channel speaker verification with ad-hoc microphone arrays

Recently, ad-hoc microphone array has been widely studied. Unlike tradit...

0 Chengdong Liang, et al. ∙

research

∙ 04/14/2021

Efficient conformer-based speech recognition with linear attention

Recently, conformer-based end-to-end automatic speech recognition, which...

0 Shengqiang Li, et al. ∙

research

∙ 03/29/2021

Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention

Self-attention (SA), which encodes vector sequences according to their p...

0 Chengdong Liang, et al. ∙

research

∙ 03/29/2021

Scaling sparsemax based channel selection for speech recognition with ad-hoc microphone arrays

Recently, speech recognition with ad-hoc microphone arrays has received ...

0 Junqi Chen, et al. ∙

research

∙ 02/24/2021

Deep NMF Topic Modeling

Nonnegative matrix factorization (NMF) based topic modeling methods do n...

0 Jianyu Wang, et al. ∙

research

∙ 01/16/2021

Minimum-volume Multichannel Nonnegative matrix factorization for blind source separation

Multichannel blind source separation aims to recover the latent sources ...

0 Jianyu Wang, et al. ∙

research

∙ 12/01/2020

Deep Ad-hoc Beamforming Based on Speaker Extraction for Target-Dependent Speech Separation

Recently, the research on ad-hoc microphone arrays with deep learning ha...

0 Ziye Yang, et al. ∙

research

∙ 11/29/2020

A comparison of handcrafted, parameterized, and learnable features for speech separation

The design of acoustic features is important for speech separation. It c...

0 Wenbo Zhu, et al. ∙

research

∙ 10/23/2020

Speech enhancement aided end-to-end multi-task learning for voice activity detection

Robust voice activity detection (VAD) is a challenging task in low signa...

8 Xu Tan, et al. ∙

research

∙ 10/23/2020

Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention

Recently, several studies reported that dot-product selfattention (SA) m...

0 Menglong Xu, et al. ∙

research

∙ 04/25/2020

Depthwise Separable Convolutional ResNet with Squeeze-and-Excitation Blocks for Small-footprint Keyword Spotting

One difficult problem of keyword spotting is how to miniaturize its memo...

0 Menglong Xu, et al. ∙

research

∙ 03/31/2020

Augmented Q Imitation Learning (AQIL)

The study of unsupervised learning can be generally divided into two cat...

0 Xiao-Lei Zhang, et al. ∙

research

∙ 11/19/2019

Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification

Deep embedding based text-independent speaker verification has demonstra...

0 Zhongxin Bai, et al. ∙

research

∙ 10/24/2019

Deep topic modeling by multilayer bootstrap network and lasso

Topic modeling is widely studied for the dimension reduction and analysi...

0 Jianyu Wang, et al. ∙

research

∙ 10/24/2019

Multi-channel Speech Separation Using Deep Embedding Model with Multilayer Bootstrap Networks

Recently, deep clustering (DPCL) based speaker-independent speech separa...

0 Ziye Yang, et al. ∙

research

∙ 11/03/2018

Deep Ad-hoc Beamforming

Deep learning based speech enhancement methods face two problems. First,...

0 Xiao-Lei Zhang, et al. ∙

research

∙ 02/12/2018

Linear Regression for Speaker Verification

This paper presents a linear regression based back-end for speaker verif...

0 Xiao-Lei Zhang, et al. ∙

research

∙ 03/22/2015

Unsupervised model compression for multilayer bootstrap networks

Recently, multilayer bootstrap network (MBN) has demonstrated promising ...

0 Xiao-Lei Zhang, et al. ∙

research

∙ 12/03/2014

Deep Distributed Random Samplings for Supervised Learning: An Alternative to Random Forests?

In (zhang2014nonlinear,zhang2014nonlinear2), we have viewed machine lear...

0 Xiao-Lei Zhang, et al. ∙

research

∙ 08/05/2014

Multilayer bootstrap networks

Multilayer bootstrap network builds a gradually narrowed multilayer nonl...

0 Xiao-Lei Zhang, et al. ∙

research

∙ 08/22/2013

Learning Deep Representation Without Parameter Inference for Nonlinear Dimensionality Reduction

Unsupervised deep learning is one of the most powerful representation le...

0 Xiao-Lei Zhang, et al. ∙

research

∙ 03/04/2013

Denoising Deep Neural Networks Based Voice Activity Detection

Recently, the deep-belief-networks (DBN) based voice activity detection ...

0 Xiao-Lei Zhang, et al. ∙

Xiao-Lei Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro