Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments

08/27/2022
by   Muskan Garg, et al.
0

This research work is about recent development made in speech recognition. In this research work, analysis of isolated digit recognition in the presence of different bit rates and at different noise levels has been performed. This research work has been carried using audacity and HTK toolkit. Hidden Markov Model (HMM) is the recognition model which was used to perform this experiment. The feature extraction techniques used are Mel Frequency Cepstrum coefficient (MFCC), Linear Predictive Coding (LPC), perceptual linear predictive (PLP), mel spectrum (MELSPEC), filter bank (FBANK). There were three types of different noise levels which have been considered for testing of data. These include random noise, fan noise and random noise in real time environment. This was done to analyse the best environment which can used for real time applications. Further, five different types of commonly used bit rates at different sampling rates were considered to find out the most optimum bit rate.

READ FULL TEXT

page 6

page 7

research
07/19/2013

Speaker Independent Continuous Speech to Text Converter for Mobile Application

An efficient speech to text converter for mobile application is presente...
research
07/12/2019

Pykaldi2: Yet another speech toolkit based on Kaldi and Pytorch

We introduce PyKaldi2 speech recognition toolkit implemented based on Ka...
research
09/09/2016

An empirical study on the effects of different types of noise in image classification tasks

Image classification is one of the main research problems in computer vi...
research
07/09/2014

Online Stroke and Akshara Recognition GUI in Assamese Language Using Hidden Markov Model

The work describes the development of Online Assamese Stroke & Akshara R...
research
04/15/2016

Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding

Most current very low bit rate (VLBR) speech coding systems use hidden M...
research
04/30/2020

A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications

Auditory models are commonly used as feature extractors for automatic sp...
research
07/14/2015

Feature Normalisation for Robust Speech Recognition

Speech recognition system performance degrades in noisy environments. If...

Please sign up or login with your details

Forgot password? Click here to reset