Enabling large language models to effectively utilize real-world tools i...
On-device end-to-end (E2E) models have shown improvements over a convent...
In voice-enabled applications, a predetermined hotword isusually used to...
While a streaming voice assistant system has been used in many applicati...
In this paper, we propose a dynamic cascaded encoder Automatic Speech
Re...
Personalization of on-device speech recognition (ASR) has seen explosive...
VoiceFilter-Lite is a speaker-conditioned voice separation model that pl...
In this paper, we propose a solution to allow speaker conditioned speech...
In this paper, we introduce a streaming keyphrase detection system that ...
Analysis of online reviews has attracted great attention with broad
appl...
End-to-end (E2E) models have shown to outperform state-of-the-art
conven...
Knowledge Distillation is an effective method of transferring knowledge ...
In automatic speech recognition (ASR), model pruning is a widely adopted...
Thus far, end-to-end (E2E) models have not been shown to outperform
stat...
The requirements for many applications of state-of-the-art speech recogn...
Lingvo is a Tensorflow framework offering a complete solution for
collab...
End-to-end (E2E) models, which directly predict output character sequenc...
Augmented reality (AR) displays become more and more popular recently,
b...