Feature Joint-State Posterior Estimation in Factorial Speech Processing Models using Deep Neural Networks

07/10/2017
by   Mahdi Khademian, et al.
0

This paper proposes a new method for calculating joint-state posteriors of mixed-audio features using deep neural networks to be used in factorial speech processing models. The joint-state posterior information is required in factorial models to perform joint-decoding. The novelty of this work is its architecture which enables the network to infer joint-state posteriors from the pairs of state posteriors of stereo features. This paper defines an objective function to solve an underdetermined system of equations, which is used by the network for extracting joint-state posteriors. It develops the required expressions for fine-tuning the network in a unified way. The experiments compare the proposed network decoding results to those of the vector Taylor series method and show 2.3 speech separation and recognition challenge. This achievement is substantial when we consider the simplicity of joint-state posterior extraction provided by deep neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2016

Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models

A Pascal challenge entitled monaural multi-talker speech recognition was...
research
07/29/2015

EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding

The performance of automatic speech recognition (ASR) has improved treme...
research
06/14/2022

Reconstructing vehicles from orthographic drawings using deep neural networks

This paper explores the current state-of-the-art of object reconstructio...
research
07/31/2016

Automatic 3D Point Set Reconstruction from Stereo Laparoscopic Images using Deep Neural Networks

In this paper, an automatic approach to predict 3D coordinates from ster...
research
09/14/2018

Automatic Catchphrase Extraction from Legal Case Documents via Scoring using Deep Neural Networks

In this paper, we present a method of automatic catchphrase extracting f...
research
12/31/2017

Using Deep Neural Network Approximate Bayesian Network

We present a new method to approximate posterior probabilities of Bayesi...
research
03/09/2015

Modeling State-Conditional Observation Distribution using Weighted Stereo Samples for Factorial Speech Processing Models

This paper investigates the effectiveness of factorial speech processing...

Please sign up or login with your details

Forgot password? Click here to reset