Dacheng Yin | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Nanning Zheng
97 publications
Wenjun Zeng
95 publications
Zheng-Jun Zha
76 publications
Yan Lu
70 publications
Zhiwei Xiong
42 publications
Zhizheng Zhang
39 publications
Sheng Zhao
36 publications
Lijun Wu
32 publications
Chong Luo
30 publications
Xiaoqiang Wang
28 publications
Yuwang Wang
18 publications

research

∙ 04/25/2023

Learning Trajectories are Generalization Indicators

The aim of this paper is to investigate the connection between learning ...

0 Jingwen Fu, et al. ∙

research

∙ 04/12/2023

Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss

Filler words like “um" or “uh" are common in spontaneous speech. It is d...

0 Zhiyuan Zhao, et al. ∙

research

∙ 10/24/2022

TridentSE: Guiding Speech Enhancement with 32 Global Tokens

In this paper, we present TridentSE, a novel architecture for speech enh...

0 Dacheng Yin, et al. ∙

research

∙ 06/28/2022

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion

This paper proposes a new "decompose-and-edit" paradigm for the text-bas...

0 Dacheng Yin, et al. ∙

research

∙ 02/24/2022

Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph

This paper addresses the unsupervised learning of content-style decompos...

2 Dacheng Yin, et al. ∙

research

∙ 09/12/2021

Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Given a piece of speech and its transcript text, text-based speech editi...

0 Chuanxin Tang, et al. ∙

research

∙ 02/03/2021

General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework

This paper presents a self-supervised learning framework, named MGF, for...

0 Yucheng Zhao, et al. ∙

research

∙ 11/12/2019

PHASEN: A Phase-and-Harmonics-Aware Speech Enhancement Network

Time-frequency (T-F) domain masking is a mainstream approach for single-...

0 Dacheng Yin, et al. ∙