Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition

06/20/2018
by   Titouan Parcollet, et al.
2

Recently, the connectionist temporal classification (CTC) model coupled with recurrent (RNN) or convolutional neural networks (CNN), made it easier to train speech recognition systems in an end-to-end fashion. However in real-valued models, time frame components such as mel-filter-bank energies and the cepstral coefficients obtained from them, together with their first and second order derivatives, are processed as individual elements, while a natural alternative is to process such components as composed entities. We propose to group such elements in the form of quaternions and to process these quaternions using the established quaternion algebra. Quaternion numbers and quaternion neural networks have shown their efficiency to process multidimensional inputs as entities, to encode internal dependencies, and to solve many tasks with less learning parameters than real-valued models. This paper proposes to integrate multiple feature views in quaternion-valued convolutional neural network (QCNN), to be used for sequence-to-sequence mapping with the CTC model. Promising results are reported using simple QCNNs in phoneme recognition experiments with the TIMIT corpus. More precisely, QCNNs obtain a lower phoneme error rate (PER) with less learning parameters than a competing model based on real-valued CNNs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2018

Speech recognition with quaternion neural networks

Neural network architectures are at the core of powerful automatic speec...
research
06/17/2019

Real to H-space Encoder for Speech Recognition

Deep neural networks (DNNs) and more precisely recurrent neural networks...
research
10/31/2018

Quaternion Convolutional Neural Networks for Heterogeneous Image Processing

Convolutional neural networks (CNN) have recently achieved state-of-the-...
research
05/26/2022

Acute Lymphoblastic Leukemia Detection Using Hypercomplex-Valued Convolutional Neural Networks

This paper features convolutional neural networks defined on hypercomple...
research
05/07/2020

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context

Convolutional neural networks (CNN) have shown promising results for end...
research
05/30/2017

Recurrent Estimation of Distributions

This paper presents the recurrent estimation of distributions (RED) for ...
research
02/29/2016

On Complex Valued Convolutional Neural Networks

Convolutional neural networks (CNNs) are the cutting edge model for supe...

Please sign up or login with your details

Forgot password? Click here to reset