For fine-grained generation and recognition tasks such as
minimally-supe...
Recently, there has been a growing interest in text-to-speech (TTS) meth...
Self-supervised speech models are a rapidly developing research topic in...
The rapid advancement of spoofing algorithms necessitates the developmen...
Audio deepfake detection is an emerging topic in the artificial intellig...
Current fake audio detection relies on hand-crafted features, which lose...
Text-to-speech (TTS) and voice conversion (VC) are two different tasks b...
Text-based speech editing allows users to edit speech by intuitively cut...
Previous databases have been designed to further the development of fake...
Current end-to-end code-switching Text-to-Speech (TTS) can already gener...
Speech is the fundamental mode of human communication, and its synthesis...
Many effective attempts have been made for deepfake audio detection. How...
Many effective attempts have been made for fake audio detection. However...
The existing fake audio detection systems often rely on expert experienc...
Fake audio detection is a growing concern and some relevant datasets hav...
The traditional vocoders have the advantages of high synthesis efficienc...
The text-based speech editor allows the editing of speech through intuit...
Audio deepfake detection is an emerging topic, which was included in the...
End-to-end singing voice synthesis (SVS) is attractive due to the avoida...
Diverse promising datasets have been designed to hold back the developme...