Hirokazu Kameoka

research

∙ 08/14/2023

iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN

The inverse short-time Fourier transform network (iSTFTNet) has garnered...

0 Takuhiro Kaneko, et al. ∙

research

∙ 03/24/2023

Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis

In speech synthesis, a generative adversarial network (GAN), training a ...

0 Takuhiro Kaneko, et al. ∙

research

∙ 10/20/2022

DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion

Voice conversion is a task to convert a non-linguistic feature of a give...

0 Chihiro Watanabe, et al. ∙

research

∙ 06/09/2022

Speak Like a Dog: Human to Non-human creature Voice Conversion

This paper proposes a new voice conversion (VC) task from human speech t...

0 Kohei Suzuki, et al. ∙

research

∙ 03/04/2022

iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform

In recent text-to-speech synthesis and voice conversion systems, a mel-s...

6 Takuhiro Kaneko, et al. ∙

research

∙ 09/28/2021

FastMVAE2: On improving and accelerating the fast variational autoencoder-based source separation algorithm for determined mixtures

This paper proposes a new source model and training scheme to improve th...

0 Li Li, et al. ∙

research

∙ 08/10/2021

StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition

Preserving the linguistic content of input speech is essential during vo...

0 Shoki Sakamoto, et al. ∙

research

∙ 04/14/2021

FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion

This paper proposes a non-autoregressive extension of our previously pro...

0 Hirokazu Kameoka, et al. ∙

research

∙ 04/05/2021

StarGAN-based Emotional Voice Conversion for Japanese Phrases

This paper shows that StarGAN-VC, a spectral envelope transformation met...

0 Asuka Moritani, et al. ∙

research

∙ 02/25/2021

MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames

Non-parallel voice conversion (VC) is a technique for training voice con...

0 Takuhiro Kaneko, et al. ∙

research

∙ 10/22/2020

CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion

Non-parallel voice conversion (VC) is a technique for learning mappings ...

0 Takuhiro Kaneko, et al. ∙

research

∙ 10/06/2020

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics

In this paper, we propose a non-parallel any-to-many voice conversion (V...

0 Hirokazu Kameoka, et al. ∙

research

∙ 09/18/2020

X-DC: Explainable Deep Clustering based on Learnable Spectrogram Templates

Deep neural networks (DNNs) have achieved substantial predictive perform...

0 Chihiro Watanabe, et al. ∙

research

∙ 08/27/2020

Non-Parallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks

We have previously proposed a method that allows for non-parallel voice ...

3 Hirokazu Kameoka, et al. ∙

research

∙ 08/07/2020

Pretraining Techniques for Sequence-to-Sequence Voice Conversion

Sequence-to-sequence (seq2seq) voice conversion (VC) models are attracti...

0 Wen-Chin Huang, et al. ∙

research

∙ 05/18/2020

Many-to-Many Voice Transformer Network

This paper proposes a voice conversion (VC) method based on a sequence-t...

6 Hirokazu Kameoka, et al. ∙

research

∙ 12/14/2019

Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining

We introduce a novel sequence-to-sequence (seq2seq) voice conversion (VC...

0 Wen-Chin Huang, et al. ∙

research

∙ 11/05/2019

The ASVspoof 2019 database

Automatic speaker verification (ASV) is one of the most natural and conv...

0 Xin Wang, et al. ∙

research

∙ 07/29/2019

StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion

Non-parallel multi-domain voice conversion (VC) is a technique for learn...

11 Takuhiro Kaneko, et al. ∙

research

∙ 04/09/2019

CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion

Non-parallel voice conversion (VC) is a technique for learning the mappi...

0 Takuhiro Kaneko, et al. ∙

research

∙ 04/09/2019

Crossmodal Voice Conversion

Humans are able to imagine a person's voice from the person's appearance...

0 Hirokazu Kameoka, et al. ∙

research

∙ 04/05/2019

WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation

WaveCycleGAN has recently been proposed to bridge the gap between natura...

0 Kou Tanaka, et al. ∙

research

∙ 03/29/2019

Training a Neural Speech Waveform Model using Spectral Losses of Short-Time Fourier Transform and Continuous Wavelet Transform

Recently, we proposed short-time Fourier transform (STFT)-based loss fun...

0 Shinji Takaki, et al. ∙

research

∙ 12/16/2018

Fast MVAE: Joint separation and classification of mixed sources based on multichannel variational autoencoder with auxiliary classifier

This paper proposes an alternative algorithm for multichannel variationa...

0 Li Li, et al. ∙

research

∙ 11/09/2018

AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms

This paper describes a method based on a sequence-to-sequence learning (...

0 Kou Tanaka, et al. ∙

research

∙ 11/05/2018

ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion

This paper proposes a voice conversion method based on fully convolution...

0 Hirokazu Kameoka, et al. ∙

research

∙ 09/29/2018

Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation

This paper deals with a multichannel audio source separation problem und...

0 Shogo Seki, et al. ∙

research

∙ 09/25/2018

WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks

We propose a learning-based filter that allows us to directly modify a s...

0 Kou Tanaka, et al. ∙

research

∙ 08/13/2018

ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder

This paper proposes a non-parallel many-to-many voice conversion (VC) me...

0 Hirokazu Kameoka, et al. ∙

research

∙ 08/02/2018

Semi-blind source separation with multichannel variational autoencoder

This paper proposes a multichannel source separation method called the m...

8 Hirokazu Kameoka, et al. ∙

research

∙ 06/06/2018

StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks

This paper proposes a method that allows for non-parallel many-to-many v...

2 Hirokazu Kameoka, et al. ∙

research

∙ 04/06/2018

Generative adversarial network-based approach to signal reconstruction from magnitude spectrograms

In this paper, we address the problem of reconstructing a time-domain si...

0 Keisuke Oyamada, et al. ∙

research

∙ 04/03/2018

Speech waveform synthesis from MFCC sequences with generative adversarial networks

This paper proposes a method for generating speech from filterbank mel f...

0 Lauri Juvela, et al. ∙

research

∙ 11/30/2017

Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks

We propose a parallel-data-free voice conversion (VC) method that can le...

0 Takuhiro Kaneko, et al. ∙

research

∙ 07/16/2012

Designing various component analysis at will

This paper provides a generic framework of component analysis (CA) metho...

0 Akisato Kimura, et al. ∙

Hirokazu Kameoka

Featured Co-authors

Sign in with Google

Consider DeepAI Pro