Feature extraction with mel scale separation method on noise audio recordings

12/30/2021
by   Roy Rudolf Huizen, et al.
0

This paper focuses on improving the accuracy of noise audio recordings. High-quality audio recording, extraction using the mel frequency cepstral coefficients (MFCC) method produces high accuracy. While the low-quality is because of noise, the accuracy is low. Improved accuracy by investigating the effect of bandwidth on the mel scale. The proposed improvement uses the mel scale separation methods into two frequency channels (MFCC dual channel). For the comparison method using the mel scale bandwidth without separation (MFCC single-channel). Feature analysis using k-mean clustering. The data uses a noise variance of up to -16 dB. Testing on the MFCC single channel method for -16 dB noise has an accuracy of 47.5 an accuracy better of 76.25 to reduce noise before extraction. The result is that the MFCC single-channel method has an accuracy of 82.5 accuracy better of 83.75 single-channel method has an accuracy of 92.5 has an accuracy better of 97.5 bandwidth to increase accuracy. The MFCC dual-channel method has higher accuracy.

READ FULL TEXT

page 4

page 10

research
05/17/2019

A comprehensive study of speech separation: spectrogram vs waveform separation

Speech separation has been studied widely for single-channel close-talk ...
research
03/22/2018

Speaker Clustering With Neural Networks And Audio Processing

Speaker clustering is the task of differentiating speakers in a recordin...
research
06/05/2021

Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication

Nowadays, there is a strong need to deploy the target speaker separation...
research
11/21/2017

Quantifying Performance of Bipedal Standing with Multi-channel EMG

Spinal cord stimulation has enabled humans with motor complete spinal co...
research
12/02/2020

Improved MVDR Beamforming Using LSTM Speech Models to Clean Spatial Clustering Masks

Spatial clustering techniques can achieve significant multi-channel nois...
research
06/29/2022

Decision Forest Based EMG Signal Classification with Low Volume Dataset Augmented with Random Variance Gaussian Noise

Electromyography signals can be used as training data by machine learnin...
research
10/25/2022

Enhanced Fuzzy Decomposition of Sound Into Sines, Transients, and Noise

The decomposition of sounds into sines, transients, and noise is a long-...

Please sign up or login with your details

Forgot password? Click here to reset