
-
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020
This paper describes the Microsoft speaker diarization system for monaur...
read it
-
Speaker diarization with session-level speaker embedding refinement using graph neural networks
Deep speaker embedding models have been commonly used as a building bloc...
read it
-
Advances in Online Audio-Visual Meeting Transcription
This paper describes a system that generates speaker-annotated transcrip...
read it
-
Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch
We introduce PyKaldi2 speech recognition toolkit implemented based on Ka...
read it
-
Low-Latency Speaker-Independent Continuous Speech Separation
Speaker independent continuous speech separation (SI-CSS) is a task of c...
read it
-
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks
The goal of this work is to develop a meeting transcription system that ...
read it
-
Developing Far-Field Speaker System Via Teacher-Student Learning
In this study, we develop the keyword spotting (KWS) and acoustic model ...
read it
-
Cracking the cocktail party problem by multi-beam deep attractor network
While recent progresses in neural network approaches to single-channel s...
read it
-
Noise Robust Speech Recognition Using Multi-Channel Based Channel Selection And ChannelWeighting
In this paper, we study several microphone channel selection and weighti...
read it
-
Fantastic 4 system for NIST 2015 Language Recognition Evaluation
This article describes the systems jointly submitted by Institute for In...
read it