The aim of this paper is to investigate the connection between learning
...
Filler words like “um" or “uh" are common in spontaneous speech. It is
d...
In this paper, we present TridentSE, a novel architecture for speech
enh...
This paper proposes a new "decompose-and-edit" paradigm for the text-bas...
This paper addresses the unsupervised learning of content-style decompos...
Given a piece of speech and its transcript text, text-based speech editi...
This paper presents a self-supervised learning framework, named MGF, for...
Time-frequency (T-F) domain masking is a mainstream approach for
single-...