Persian phonemes recognition using PPNet

12/17/2018
by   Saber Malekzadeh, et al.
0

In this paper a new approach for recognition of Persian phonemes on the PCVC speech dataset is proposed. Nowadays deep neural networks are playing main rule in classification tasks. However the best results in speech recognition are not as good as human recognition rate yet. Deep learning techniques are shown their outstanding performance over so many classification tasks like image classification, document classification, etc. Also in some tasks their performance were even better than human. So the reason why ASR (automatic speech recognition) systems are not as good as the human speech recognition system is mostly depend on features of data is fed to deep neural networks. In this research first sound samples are cut for exact extraction of phoneme sounds in 50ms samples. Then phonemes are grouped in 30 groups; Containing 23 consonants, 6 vowels and a silence phoneme. STFT (Short time Fourier transform) is applied on them and Then STFT results are given to PPNet (A new deep convolutional neural network architecture) classifier and a total average of 75.87 algorithms on Separated Persian phonemes (Like in PCVC speech dataset).

READ FULL TEXT
research
04/30/2019

English Broadcast News Speech Recognition by Humans and Machines

With recent advances in deep learning, considerable attention has been g...
research
10/04/2018

Deep Learning Approaches for Understanding Simple Speech Commands

Automatic classification of sound commands is becoming increasingly impo...
research
02/19/2020

Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models

Deep Learning based Automatic Speech Recognition (ASR) models are very s...
research
07/29/2018

Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI

Vocal tract configurations play a vital role in generating distinguishab...
research
04/10/2022

Deep Embeddings for Robust User-Based Amateur Vocal Percussion Classification

Vocal Percussion Transcription (VPT) is concerned with the automatic det...
research
06/10/2022

Training Neural Networks using SAT solvers

We propose an algorithm to explore the global optimization method, using...
research
06/16/2017

One Model To Learn Them All

Deep learning yields great results across many fields, from speech recog...

Please sign up or login with your details

Forgot password? Click here to reset