Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification

08/04/2021

∙

Running automatic speech recognition (ASR) on edge devices is non-trivial due to resource constraints, especially in scenarios that require supporting multiple languages. We propose a new approach to enable multilingual speech recognition on edge devices. This approach uses both language identification and accent identification to select one of multiple monolingual ASR models on-the-fly, each fine-tuned for a particular accent. Initial results for both recognition performance and resource usage are promising with our approach using less than 1/12th of the memory consumed by other solutions.

READ FULL TEXT

Dyn-ASR: Compact, Multilingual Speech Recognition via Spoken Language and Accent Identification

Sign in with Google

Consider DeepAI Pro