Kun Song

research

∙ 07/10/2023

The NPU-MSXF Speech-to-Speech Translation System for IWSLT 2023 Speech-to-Speech Translation Task

This paper describes the NPU-MSXF system for the IWSLT 2023 speech-to-sp...

0 Kun Song, et al. ∙

research

∙ 05/28/2023

StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation

Direct speech-to-speech translation (S2ST) has gradually become popular ...

0 Kun Song, et al. ∙

research

∙ 02/08/2023

Gestalt-Guided Image Understanding for Few-Shot Learning

Due to the scarcity of available data, deep learning does not perform we...

0 Kun Song, et al. ∙

research

∙ 01/17/2023

Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval

Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) is a challenging cross-...

0 Yuchen Wu, et al. ∙

research

∙ 11/19/2022

Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling

This paper aims to synthesize target speaker's speech with desired speak...

0 Xinfa Zhu, et al. ∙

research

∙ 11/02/2022

DSPGAN: a GAN-based universal vocoder for high-fidelity TTS by time-frequency domain supervision from DSP

Recent development of neural vocoders based on the generative adversaria...

0 Kun Song, et al. ∙

research

∙ 10/31/2022

Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS

In current two-stage neural text-to-speech (TTS) paradigm, it is ideal t...

0 Kun Song, et al. ∙

research

∙ 06/01/2022

AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation

Speaker adaptation in text-to-speech synthesis (TTS) is to finetune a pr...

0 Kun Song, et al. ∙

research

∙ 01/20/2022

Adaptive neighborhood Metric learning

In this paper, we reveal that metric learning would suffer from serious ...

0 Kun Song, et al. ∙

research

∙ 11/22/2019

Adaptive Nearest Neighbor: A General Framework for Distance Metric Learning

K-NN classifier is one of the most famous classification algorithms, who...

0 Kun Song, et al. ∙

Kun Song

Featured Co-authors

Sign in with Google

Consider DeepAI Pro