Novel Cascaded Gaussian Mixture Model-Deep Neural Network Classifier for Speaker Identification in Emotional Talking Environments

10/11/2018
by   Ismail Shahin, et al.
0

This research is an effort to present an effective approach to enhance text-independent speaker identification performance in emotional talking environments based on novel classifier called cascaded Gaussian Mixture Model-Deep Neural Network (GMM-DNN). Our current work focuses on proposing, implementing and evaluating a new approach for speaker identification in emotional talking environments based on cascaded Gaussian Mixture Model-Deep Neural Network as a classifier. The results point out that the cascaded GMM-DNN classifier improves speaker identification performance at various emotions using two distinct speech databases: Emirati speech database (Arabic United Arab Emirates dataset) and Speech Under Simulated and Actual Stress (SUSAS) English dataset. The proposed classifier outperforms classical classifiers such as Multilayer Perceptron (MLP) and Support Vector Machine (SVM) in each dataset. Speaker identification performance that has been attained based on the cascaded GMM-DNN is similar to that acquired from subjective assessment by human listeners.

READ FULL TEXT

page 24

page 26

research
02/11/2021

CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions

This work aims at intensifying text-independent speaker identification p...
research
12/26/2021

Novel Hybrid DNN Approaches for Speaker Verification in Emotional and Stressful Talking Environments

In this work, we conducted an empirical comparative study of the perform...
research
04/02/2019

Experiments on Open-Set Speaker Identification with Discriminatively Trained Neural Networks

This paper presents a study on discriminative artificial neural network ...
research
12/25/2017

Leveraging Native Language Speech for Accent Identification using Deep Siamese Networks

The problem of automatic accent identification is important for several ...
research
10/28/2017

Investigation of Frame Alignments for GMM-based Text-prompted Speaker Verification

The frame alignment acts as an important role in GMM-based speaker verif...
research
12/11/2018

A cascaded multiple-speaker localization and tracking system

This paper presents an online multiple-speaker localization and tracking...
research
03/05/2022

Language vs Speaker Change: A Comparative Study

Spoken language change detection (LCD) refers to detecting language swit...

Please sign up or login with your details

Forgot password? Click here to reset