Bunched LPCNet2: Efficient Neural Vocoders Covering Devices from Cloud to Edge

03/27/2022
by   Sangjun Park, et al.
0

Text-to-Speech (TTS) services that run on edge devices have many advantages compared to cloud TTS, e.g., latency and privacy issues. However, neural vocoders with a low complexity and small model footprint inevitably generate annoying sounds. This study proposes a Bunched LPCNet2, an improved LPCNet architecture that provides highly efficient performance in high-quality for cloud servers and in a low-complexity for low-resource edge devices. Single logistic distribution achieves computational efficiency, and insightful tricks reduce the model footprint while maintaining speech quality. A DualRate architecture, which generates a lower sampling rate from a prosody model, is also proposed to reduce maintenance costs. The experiments demonstrate that Bunched LPCNet2 generates satisfactory speech quality with a model footprint of 1.1MB while operating faster than real-time on a RPi 3B. Our audio samples are available at https://srtts.github.io/bunchedLPCNet2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2020

CacheNet: A Model Caching Framework for Deep Learning Inference on the Edge

The success of deep neural networks (DNN) in machine perception applicat...
research
11/25/2020

FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge

Nowadays more and more applications can benefit from edge-based text-to-...
research
12/08/2022

Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity

GAN vocoders are currently one of the state-of-the-art methods for build...
research
05/23/2023

EfficientSpeech: An On-Device Text to Speech Model

State of the art (SOTA) neural text to speech (TTS) models can generate ...
research
08/31/2023

LightGrad: Lightweight Diffusion Probabilistic Model for Text-to-Speech

Recent advances in neural text-to-speech (TTS) models bring thousands of...
research
03/13/2023

AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments

Deep learning models are increasingly deployed to edge devices for real-...
research
11/27/2020

Rethinking Generalization in American Sign Language Prediction for Edge Devices with Extremely Low Memory Footprint

Due to the boom in technical compute in the last few years, the world ha...

Please sign up or login with your details

Forgot password? Click here to reset