
-
Continuous Speech Separation with Ad Hoc Microphone Arrays
Speech separation has been shown effective for multi-talker speech recog...
read it
-
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings
An end-to-end (E2E) speaker-attributed automatic speech recognition (SA-...
read it
-
Exploring End-to-End Multi-channel ASR with Bias Information for Meeting Transcription
Joint optimization of multi-channel front-end and automatic speech recog...
read it
-
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR
Recently, an end-to-end speaker-attributed automatic speech recognition ...
read it
-
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
Multi-speaker speech recognition of unsegmented recordings has diverse a...
read it
-
Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
With its strong modeling capacity that comes from a multi-head and multi...
read it
-
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020
This paper describes the Microsoft speaker diarization system for monaur...
read it
-
An End-to-end Architecture of Online Multi-channel Speech Separation
Multi-speaker speech recognition has been one of the keychallenges in co...
read it
-
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings
Recently, an end-to-end (E2E) speaker-attributed automatic speech recogn...
read it
-
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
We propose an end-to-end speaker-attributed automatic speech recognition...
read it
-
Neural Speech Separation Using Spatially Distributed Microphones
This paper proposes a neural network based speech separation method usin...
read it
-
Serialized Output Training for End-to-End Overlapped Speech Recognition
This paper proposes serialized output training (SOT), a novel framework ...
read it
-
Continuous speech separation: dataset and analysis
This paper describes a dataset and protocols for evaluating continuous s...
read it
-
Advances in Online Audio-Visual Meeting Transcription
This paper describes a system that generates speaker-annotated transcrip...
read it
-
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
An important problem in ad-hoc microphone speech separation is how to gu...
read it
-
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation
Recent studies in deep learning-based speech separation have proven the ...
read it
-
DOVER: A Method for Combining Diarization Outputs
Speech recognition and other natural language tasks have long benefited ...
read it
-
Meeting Transcription Using Virtual Microphone Arrays
We describe a system that generates speaker-annotated transcripts of mee...
read it
-
Low-Latency Speaker-Independent Continuous Speech Separation
Speaker independent continuous speech separation (SI-CSS) is a task of c...
read it
-
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
The goal of this work is to develop a meeting transcription system that ...
read it
-
Cracking the cocktail party problem by multi-beam deep attractor network
While recent progresses in neural network approaches to single-channel s...
read it