This paper introduces the proposed automatic minuting system of the Hita...
When people try to influence others to do something, they subconsciously...
In this paper, we present a semi-supervised training technique using
pse...
In this paper, we present a conditional multitask learning method for
en...
This paper proposes an online end-to-end diarization that can handle
ove...
In this paper we propose a new cross-lingual Voice Conversion (VC) appro...
This paper investigates the utilization of an end-to-end diarization mod...
We propose a block-online algorithm of guided source separation (GSS). G...
A novel framework for meeting transcription using asynchronous microphon...
End-to-end speaker diarization using a fully supervised self-attention
m...
Speaker diarization is an essential step for processing multi-speaker au...
End-to-end speaker diarization for an unknown number of speakers is addr...
The most common approach to speaker diarization is clustering of speaker...
Emotion labels in emotion recognition corpora are highly noisy and ambig...
This paper investigates the use of target-speaker automatic speech
recog...
Speaker diarization has been mainly developed based on the clustering of...
In this paper, we propose a novel end-to-end neural-network-based speake...
In this paper, we propose a novel auxiliary loss function for target-spe...
In this paper, we present Hitachi and Paderborn University's joint effor...