Kenji Nagamatsu

research

∙ 12/06/2021

Team Hitachi @ AutoMin 2021: Reference-free Automatic Minuting Pipeline with Argument Structure Construction over Topic-based Summarization

This paper introduces the proposed automatic minuting system of the Hita...

0 Atsuki Yamaguchi, et al. ∙

research

∙ 09/27/2021

Emotional Speech Synthesis for Companion Robot to Imitate Professional Caregiver Speech

When people try to influence others to do something, they subconsciously...

0 Takeshi Homma, et al. ∙

research

∙ 06/09/2021

Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization

In this paper, we present a semi-supervised training technique using pse...

0 Yuki Takashima, et al. ∙

research

∙ 06/08/2021

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

In this paper, we present a conditional multitask learning method for en...

0 Yuki Takashima, et al. ∙

research

∙ 01/21/2021

Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers

This paper proposes an online end-to-end diarization that can handle ove...

0 Yawen Xue, et al. ∙

research

∙ 12/28/2020

Building Multi lingual TTS using Cross Lingual Voice Conversion

In this paper we propose a new cross-lingual Voice Conversion (VC) appro...

0 Qinghua Sun, et al. ∙

research

∙ 12/18/2020

End-to-End Speaker Diarization as Post-Processing

This paper investigates the utilization of an end-to-end diarization mod...

0 Shota Horiguchi, et al. ∙

research

∙ 11/16/2020

Block-Online Guided Source Separation

We propose a block-online algorithm of guided source separation (GSS). G...

0 Shota Horiguchi, et al. ∙

research

∙ 07/31/2020

Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones

A novel framework for meeting transcription using asynchronous microphon...

0 Shota Horiguchi, et al. ∙

research

∙ 06/04/2020

Online End-to-End Neural Diarization with Speaker-Tracing Buffer

End-to-end speaker diarization using a fully supervised self-attention m...

0 Yawen Xue, et al. ∙

research

∙ 06/02/2020

Neural Speaker Diarization with Speaker-Wise Chain Rule

Speaker diarization is an essential step for processing multi-speaker au...

0 Yusuke Fujita, et al. ∙

research

∙ 05/20/2020

End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors

End-to-end speaker diarization for an unknown number of speakers is addr...

0 Shota Horiguchi, et al. ∙

research

∙ 02/24/2020

End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification

The most common approach to speaker diarization is clustering of speaker...

0 Yusuke Fujita, et al. ∙

research

∙ 11/06/2019

Addressing Ambiguity of Emotion Labels Through Meta-learning

Emotion labels in emotion recognition corpora are highly noisy and ambig...

0 Takuya Fujioka, et al. ∙

research

∙ 09/17/2019

Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models

This paper investigates the use of target-speaker automatic speech recog...

0 Naoyuki Kanda, et al. ∙

research

∙ 09/13/2019

End-to-End Neural Speaker Diarization with Self-attention

Speaker diarization has been mainly developed based on the clustering of...

0 Yusuke Fujita, et al. ∙

research

∙ 09/12/2019

End-to-End Neural Speaker Diarization with Permutation-Free Objectives

In this paper, we propose a novel end-to-end neural-network-based speake...

0 Yusuke Fujita, et al. ∙

research

∙ 06/26/2019

Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition

In this paper, we propose a novel auxiliary loss function for target-spe...

0 Naoyuki Kanda, et al. ∙

research

∙ 05/29/2019

Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR

In this paper, we present Hitachi and Paderborn University's joint effor...

0 Naoyuki Kanda, et al. ∙

Kenji Nagamatsu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro