Deep Learning for Prominence Detection in Children's Read Speech

04/12/2021
by   Kamini Sabu, et al.
0

Expressive reading, considered the defining attribute of oral reading fluency, comprises the prosodic realization of phrasing and prominence. In the context of evaluating oral reading, it helps to establish the speaker's comprehension of the text. We consider a labeled dataset of children's reading recordings for the speaker-independent detection of prominent words using acoustic-prosodic and lexico-syntactic features. A previous well-tuned random forest ensemble predictor is replaced by an RNN sequence classifier to exploit potential context dependency across the longer utterance. Further, deep learning is applied to obtain word-level features from low-level acoustic contours of fundamental frequency, intensity and spectral shape in an end-to-end fashion. Performance comparisons are presented across the different feature types and across different feature learning architectures for prominent word prediction to draw insights wherever possible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2021

Predicting lexical skills from oral reading with acoustic measures

Literacy assessment is an important activity for education administrator...
research
03/09/2021

Attention-driven read-aloud technology increases reading comprehension in children with reading disabilities

The paper presents the design of an assistive reading tool that integrat...
research
11/04/2018

Investigating context features hidden in End-to-End TTS

Recent studies have introduced end-to-end TTS, which integrates the prod...
research
03/04/2021

End-to-end acoustic modelling for phone recognition of young readers

Automatic recognition systems for child speech are lagging behind those ...
research
02/07/2023

Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision

We introduce SPEAR-TTS, a multi-speaker text-to-speech (TTS) system that...
research
03/29/2022

Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations

This paper presents a macroscopic approach to automatic detection of spe...
research
11/27/2019

A Dataset for measuring reading levels in India at scale

One out of four children in India are leaving grade eight without basic ...

Please sign up or login with your details

Forgot password? Click here to reset