Atss-Net: Target Speaker Separation via Attention-based Neural Network

05/19/2020
by   Tingle Li, et al.
0

Recently, Convolutional Neural Network (CNN) and Long short-term memory (LSTM) based models have been introduced to deep learning-based target speaker separation. In this paper, we propose an Attention-based neural network (Atss-Net) in the spectrogram domain for the task. It allows the network to compute the correlation between each feature parallelly, and using shallower layers to extract more features, compared with the CNN-LSTM architecture. Experimental results show that our Atss-Net yields better performance than the VoiceFilter, although it only contains half of the parameters. Furthermore, our proposed model also demonstrates promising performance in speech enhancement.

READ FULL TEXT
research
05/31/2021

Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement

This paper proposes an noise type classification aided attention-based n...
research
03/22/2017

Hierarchical RNN with Static Sentence-Level Attention for Text-Based Speaker Change Detection

Traditional speaker change detection in dialogues is typically based on ...
research
01/21/2023

Estimation of Sea State Parameters from Ship Motion Responses Using Attention-based Neural Networks

On-site estimation of sea state parameters is crucial for ship navigatio...
research
01/07/2021

Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario

Multi-task learning (MTL) and attention mechanism have been proven to ef...
research
06/07/2014

Application and Verification of Algorithm Learning Based Neural Network

This paper has been withdrawn by the author due to a crucial accuracy er...
research
02/08/2021

Extracting the Locus of Attention at a Cocktail Party from Single-Trial EEG using a Joint CNN-LSTM Model

Human brain performs remarkably well in segregating a particular speaker...
research
08/06/2022

Detecting Algorithmically Generated Domains Using a GCNN-LSTM Hybrid Neural Network

Domain generation algorithm (DGA) is used by botnets to build a stealthy...

Please sign up or login with your details

Forgot password? Click here to reset