Real to H-space Encoder for Speech Recognition

06/17/2019
by   Titouan Parcollet, et al.
0

Deep neural networks (DNNs) and more precisely recurrent neural networks (RNNs) are at the core of modern automatic speech recognition systems, due to their efficiency to process input sequences. Recently, it has been shown that different input representations, based on multidimensional algebras, such as complex and quaternion numbers, are able to bring to neural networks a more natural, compressive and powerful representation of the input signal by outperforming common real-valued NNs. Indeed, quaternion-valued neural networks (QNNs) better learn both internal dependencies, such as the relation between the Mel-filter-bank value of a specific time frame and its time derivatives, and global dependencies, describing the relations that exist between time frames. Nonetheless, QNNs are limited to quaternion-valued input signals, and it is difficult to benefit from this powerful representation with real-valued input data. This paper proposes to tackle this weakness by introducing a real-to-quaternion encoder that allows QNNs to process any one dimensional input features, such as traditional Mel-filter-banks for automatic speech recognition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2018

Speech recognition with quaternion neural networks

Neural network architectures are at the core of powerful automatic speec...
research
06/20/2018

Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition

Recently, the connectionist temporal classification (CTC) model coupled ...
research
05/18/2020

Quaternion Neural Networks for Multi-channel Distant Speech Recognition

Despite the significant progress in automatic speech recognition (ASR), ...
research
10/31/2018

Quaternion Convolutional Neural Networks for Heterogeneous Image Processing

Convolutional neural networks (CNN) have recently achieved state-of-the-...
research
09/14/2023

Understanding Vector-Valued Neural Networks and Their Relationship with Real and Hypercomplex-Valued Neural Networks

Despite the many successful applications of deep learning models for mul...
research
12/22/2014

Learning linearly separable features for speech recognition using convolutional neural networks

Automatic speech recognition systems usually rely on spectral-based feat...
research
05/30/2017

Recurrent Estimation of Distributions

This paper presents the recurrent estimation of distributions (RED) for ...

Please sign up or login with your details

Forgot password? Click here to reset