EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement

02/14/2022
by   Kuan-Chen Wang, et al.
0

Multimodal learning has been proven to be an effective method to improve speech enhancement (SE) performance, especially in challenging situations such as low signal-to-noise ratios, speech noise, or unseen noise types. In previous studies, several types of auxiliary data have been used to construct multimodal SE systems, such as lip images, electropalatography, or electromagnetic midsagittal articulography. In this paper, we propose a novel EMGSE framework for multimodal SE, which integrates audio and facial electromyography (EMG) signals. Facial EMG is a biological signal containing articulatory movement information, which can be measured in a non-invasive way. Experimental results show that the proposed EMGSE system can achieve better performance than the audio-only SE system. The benefits of fusing EMG signals with acoustic signals for SE are notable under challenging circumstances. Furthermore, this study reveals that cheek EMG is sufficient for SE.

READ FULL TEXT
research
09/01/2017

Audio-Visual Speech Enhancement based on Multimodal Deep Convolutional Neural Network

Speech enhancement (SE) aims to reduce noise in speech signals. Most SE ...
research
10/27/2022

Audio Signal Enhancement with Learning from Positive and Unlabelled Data

Supervised learning is a mainstream approach to audio signal enhancement...
research
11/22/2019

Time-Domain Multi-modal Bone/air Conducted Speech Enhancement

Integrating modalities, such as video signals with speech, has been show...
research
11/08/2018

Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform

In this paper, we propose a novel speech enhancement (SE) method by expl...
research
12/17/2020

Speech Enhancement with Zero-Shot Model Selection

Recent research on speech enhancement (SE) has seen the emergence of dee...
research
08/13/2020

Incorporating Broad Phonetic Information for Speech Enhancement

In noisy conditions, knowing speech contents facilitates listeners to mo...
research
04/30/2019

Incorporating Symbolic Sequential Modeling for Speech Enhancement

In a noisy environment, a lossy speech signal can be automatically resto...

Please sign up or login with your details

Forgot password? Click here to reset