Large Language Models (LLMs) demonstrate impressive capabilities, yet
in...
Recent innovations in self-supervised representation learning have led t...
Intent classifiers are vital to the successful operation of virtual agen...
Self-supervised pre-trained features have consistently delivered state-o...
Large speech emotion recognition datasets are hard to obtain, and small
...
We present a comprehensive study on building and adapting RNN transducer...
Training an end-to-end (E2E) neural network speech-to-intent (S2I) syste...
An essential component of spoken language understanding (SLU) is slot
fi...
With the rise of voice-activated applications, the need for speaker
reco...
We present a lightweight adaptable neural TTS system with high quality
o...