Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus

01/30/2017
by   Shrikant Malviya, et al.
0

Automatic speech recognition (ASR) and Text to speech (TTS) are two prominent area of research in human computer interaction nowadays. A set of phonetically rich sentences is in a matter of importance in order to develop these two interactive modules of HCI. Essentially, the set of phonetically rich sentences has to cover all possible phone units distributed uniformly. Selecting such a set from a big corpus with maintaining phonetic characteristic based similarity is still a challenging problem. The major objective of this paper is to devise a criteria in order to select a set of sentences encompassing all phonetic aspects of a corpus with size as minimum as possible. First, this paper presents a statistical analysis of Hindi phonetics by observing the structural characteristics. Further a two stage algorithm is proposed to extract phonetically rich sentences with a high variety of triphones from the EMILLE Hindi corpus. The algorithm consists of a distance measuring criteria to select a sentence in order to improve the triphone distribution. Moreover, a special preprocessing method is proposed to score each triphone in terms of inverse probability in order to fasten the algorithm. The results show that the approach efficiently build uniformly distributed phonetically-rich corpus with optimum number of sentences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2022

BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm

The performance of speech-processing models is heavily influenced by the...
research
08/13/2020

MASRI-HEADSET: A Maltese Corpus for Speech Recognition

Maltese, the national language of Malta, is spoken by approximately 500,...
research
07/09/2018

Detecting Levels of Depression in Text Based on Metrics

Depression is one of the most common and a major concern for society. Pr...
research
05/17/2020

LiSSS: A toy corpus of Spanish Literary Sentences for Emotions detection

In this work we present a new small data-set in Computational Creativity...
research
09/19/2019

A Corpus for Automatic Readability Assessment and Text Simplification of German

In this paper, we present a corpus for use in automatic readability asse...
research
06/13/2021

Cross-sentence Neural Language Models for Conversational Speech Recognition

An important research direction in automatic speech recognition (ASR) ha...
research
10/06/2017

The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments

This paper introduces the contents and the possible usage of the DIRHA-E...

Please sign up or login with your details

Forgot password? Click here to reset