b'Xiong Xiao'

research

∙ 03/08/2023

A robust method for reliability updating with equality information using sequential adaptive importance sampling

Reliability updating refers to a problem that integrates Bayesian updati...

0 Xiong Xiao, et al. ∙

research

∙ 12/24/2022

A Bayesian Robust Regression Method for Corrupted Data Reconstruction

Because of the widespread existence of noise and data corruption, recove...

0 Fan Zheyi, et al. ∙

research

∙ 08/27/2022

Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization

This paper describes a speaker diarization model based on target speaker...

0 Dongmei Wang, et al. ∙

research

∙ 03/30/2022

Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings

This paper presents a streaming speaker-attributed automatic speech reco...

0 Naoyuki Kanda, et al. ∙

research

∙ 02/02/2022

Streaming Multi-Talker ASR with Token-Level Serialized Output Training

This paper proposes a token-level serialized output training (t-SOT), a ...

0 Naoyuki Kanda, et al. ∙

research

∙ 01/17/2022

Optimal monitoring location for risk tracking of geotechnical systems: theory and application to tunneling excavation risks

The maturity of structural health monitoring technology brings ever-incr...

0 Zeyu Wang, et al. ∙

research

∙ 10/27/2021

Separating Long-Form Speech with Group-Wise Permutation Invariant Training

Multi-talker conversational speech processing has drawn many interests f...

0 Wangyou Zhang, et al. ∙

research

∙ 10/26/2021

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

Self-supervised learning (SSL) achieves great success in speech recognit...

0 Sanyuan Chen, et al. ∙

research

∙ 10/07/2021

Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR

This paper presents Transcribe-to-Diarize, a new approach for neural spe...

0 Naoyuki Kanda, et al. ∙

research

∙ 09/22/2021

Diarisation using location tracking with agglomerative clustering

Previous works have shown that spatial location information can be compl...

0 Jeremy H. M. Wong, et al. ∙

research

∙ 07/06/2021

A Comparative Study of Modular and Joint Approaches for Speaker-Attributed ASR on Monaural Long-Form Audio

Speaker-attributed automatic speech recognition (SA-ASR) is a task to re...

0 Naoyuki Kanda, et al. ∙

research

∙ 02/06/2021

Speaker attribution with voice profiles by graph-based semi-supervised learning

Speaker attribution is required in many real-world applications, such as...

0 Jixuan Wang, et al. ∙

research

∙ 10/22/2020

Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020

This paper describes the Microsoft speaker diarization system for monaur...

0 Xiong Xiao, et al. ∙

research

∙ 05/22/2020

Speaker diarization with session-level speaker embedding refinement using graph neural networks

Deep speaker embedding models have been commonly used as a building bloc...

6 Jixuan Wang, et al. ∙

research

∙ 12/10/2019

Advances in Online Audio-Visual Meeting Transcription

This paper describes a system that generates speaker-annotated transcrip...

15 Takuya Yoshioka, et al. ∙

research

∙ 07/12/2019

Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch

We introduce PyKaldi2 speech recognition toolkit implemented based on Ka...

0 Liang Lu, et al. ∙

research

∙ 04/13/2019

Low-Latency Speaker-Independent Continuous Speech Separation

Speaker independent continuous speech separation (SI-CSS) is a task of c...

0 Takuya Yoshioka, et al. ∙

research

∙ 10/08/2018

Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks

The goal of this work is to develop a meeting transcription system that ...

0 Takuya Yoshioka, et al. ∙

research

∙ 04/14/2018

Developing Far-Field Speaker System Via Teacher-Student Learning

In this study, we develop the keyword spotting (KWS) and acoustic model ...

0 Jinyu Li, et al. ∙

research

∙ 03/29/2018

Cracking the cocktail party problem by multi-beam deep attractor network

While recent progresses in neural network approaches to single-channel s...

0 Zhuo Chen, et al. ∙

research

∙ 04/12/2016

Noise Robust Speech Recognition Using Multi-Channel Based Channel Selection And ChannelWeighting

In this paper, we study several microphone channel selection and weighti...

0 Zhaofeng Zhang, et al. ∙

research

∙ 02/05/2016

Fantastic 4 system for NIST 2015 Language Recognition Evaluation

This article describes the systems jointly submitted by Institute for In...

0 Kong Aik Lee, et al. ∙

Xiong Xiao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro