Federated Domain Adaptation for ASR with Full Self-Supervision

03/30/2022
by   Junteng Jia, et al.
0

Cross-device federated learning (FL) protects user privacy by collaboratively training a model on user devices, therefore eliminating the need for collecting, storing, and manually labeling user data. While important topics such as the FL training algorithm, non-IID-ness, and Differential Privacy have been well studied in the literature, this paper focuses on two challenges of practical importance for improving on-device ASR: the lack of ground-truth transcriptions and the scarcity of compute resource and network bandwidth on edge devices. First, we propose a FL system for on-device ASR domain adaptation with full self-supervision, which uses self-labeling together with data augmentation and filtering techniques. The system can improve a strong Emformer-Transducer based ASR model pretrained on out-of-domain data, using in-domain audio without any ground-truth transcriptions. Second, to reduce the training cost, we propose a self-restricted RNN Transducer (SR-RNN-T) loss, a variant of alignment-restricted RNN-T that uses Viterbi alignments from self-supervision. To further reduce the compute and network cost, we systematically explore adapting only a subset of weights in the Emformer-Transducer. Our best training recipe achieves a 12.9% relative WER reduction over the strong out-of-domain baseline, which equals 70% of the reduction achievable with full human supervision and centralized training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2019

Private Federated Learning with Domain Adaptation

Federated Learning (FL) is a distributed machine learning (ML) paradigm ...
research
08/25/2022

DPAUC: Differentially Private AUC Computation in Federated Learning

Federated learning (FL) has gained significant attention recently as a p...
research
08/25/2023

Resource-Efficient Federated Learning for Heterogenous and Resource-Constrained Environments

Federated Learning (FL) is a privacy-enforcing sub-domain of machine lea...
research
06/06/2022

FedNST: Federated Noisy Student Training for Automatic Speech Recognition

Federated Learning (FL) enables training state-of-the-art Automatic Spee...
research
05/06/2022

Federated Learning with Noisy User Feedback

Machine Learning (ML) systems are getting increasingly popular, and driv...
research
09/11/2021

Utility Fairness for the Differentially Private Federated Learning

Federated learning (FL) allows predictive model training on the sensed d...
research
06/14/2021

Dynamic Gradient Aggregation for Federated Domain Adaptation

In this paper, a new learning algorithm for Federated Learning (FL) is i...

Please sign up or login with your details

Forgot password? Click here to reset