Spoken semantic parsing (SSP) involves generating machine-comprehensible...
Large-scale generative models such as GPT and DALL-E have revolutionized...
This paper presents a method for selecting appropriate synthetic speech
...
The awareness for biased ASR datasets or models has increased notably in...
Self-supervised learning via masked prediction pre-training (MPPT) has s...
It is well known that many machine learning systems demonstrate bias tow...
Although speaker verification has conventionally been an audio-only task...
Spoken language understanding (SLU) datasets, like many other machine
le...
In scenarios where multiple speakers talk at the same time, it is import...
We propose an unsupervised speaker adaptation method inspired by the neu...