Implementation of an Automatic Syllabic Division Algorithm from Speech Files in Portuguese Language

01/29/2015
by   E. L. F. Da Silva, et al.
0

A new algorithm for voice automatic syllabic splitting in the Portuguese language is proposed, which is based on the envelope of the speech signal of the input audio file. A computational implementation in MatlabTM is presented and made available at the URL http://www2.ee.ufpe.br/codec/divisao_silabica.html. Due to its straightforwardness, the proposed method is very attractive for embedded systems (e.g. i-phones). It can also be used as a screen to assist more sophisticated methods. Voice excerpts containing more than one syllable and identified by the same envelope are named as super-syllables and they are subsequently separated. The results indicate which samples corresponds to the beginning and end of each detected syllable. Preliminary tests were performed to fifty words at an identification rate circa 70 incorporated to treat particular phonemes). This algorithm is also useful in voice command systems, as a tool in the teaching of Portuguese language or even for patients with speech pathology.

READ FULL TEXT

page 2

page 5

research
11/03/2022

A speech corpus for chronic kidney disease

In this study, we present a speech corpus of patients with chronic kidne...
research
05/29/2022

Speaker Identification using Speech Recognition

The audio data is increasing day by day throughout the globe with the in...
research
11/15/2020

Respiratory Distress Detection from Telephone Speech using Acoustic and Prosodic Features

With the widespread use of telemedicine services, automatic assessment o...
research
02/18/2019

Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks

Voice cloning technologies have found applications in a variety of areas...
research
02/03/2020

End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection

This paper integrates a voice activity detection (VAD) function with end...
research
02/19/2021

Introducing an experimental distortion-tolerant speech encryption scheme for secure voice communication

The current increasing need for privacy-preserving voice communications ...
research
11/27/2019

Jejueo Datasets for Machine Translation and Speech Synthesis

Jejueo was classified as critically endangered by UNESCO in 2010. Althou...

Please sign up or login with your details

Forgot password? Click here to reset