Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition

02/17/2022
by   Chao-Han Huck Yang, et al.
1

In this work, we aim to enhance the system robustness of end-to-end automatic speech recognition (ASR) against adversarially-noisy speech examples. We focus on a rigorous and empirical "closed-model adversarial robustness" setting (e.g., on-device or cloud applications). The adversarial noise is only generated by closed-model optimization (e.g., evolutionary and zeroth-order estimation) without accessing gradient information of a targeted ASR model directly. We propose an advanced Bayesian neural network (BNN) based adversarial detector, which could model latent distributions against adaptive adversarial perturbation with divergence measurement. We further simulate deployment scenarios of RNN Transducer, Conformer, and wav2vec-2.0 based ASR systems with the proposed adversarial detection system. Leveraging the proposed BNN based detection system, we improve detection rate by +2.77 to +5.42 (relative +3.03 to +6.26 LibriSpeech datasets compared to the current model enhancement methods against the adversarial speech examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2021

Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise

An automatic speech recognition (ASR) system based on a deep neural netw...
research
09/04/2018

HASP: A High-Performance Adaptive Mobile Security Enhancement Against Malicious Speech Recognition

Nowadays, machine learning based Automatic Speech Recognition (ASR) tech...
research
03/31/2020

Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

Recent studies have highlighted adversarial examples as ubiquitous threa...
research
08/02/2023

Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time

Automatic speech recognition (ASR) systems have been shown to be vulnera...
research
10/26/2022

There is more than one kind of robustness: Fooling Whisper with adversarial examples

Whisper is a recent Automatic Speech Recognition (ASR) model displaying ...
research
07/25/2020

MP3 Compression To Diminish Adversarial Noise in End-to-End Speech Recognition

Audio Adversarial Examples (AAE) represent specially created inputs mean...
research
12/18/2019

A Cycle-GAN Approach to Model Natural Perturbations in Speech for ASR Applications

Naturally introduced perturbations in audio signal, caused by emotional ...

Please sign up or login with your details

Forgot password? Click here to reset