A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus

06/05/2023
by   Lufei Gao, et al.
0

Cued Speech (CS) is a multi-modal visual coding system combining lip reading with several hand cues at the phonetic level to make the spoken language visible to the hearing impaired. Previous studies solved asynchronous problems between lip and hand movements by a cuer[The people who perform Cued Speech are called the cuer.]-dependent piecewise linear model for English and French CS. In this work, we innovatively propose three statistical measure on the lip stream to build an interpretable and generalizable model for predicting hand preceding time (HPT), which achieves cuer-independent by a proper normalization. Particularly, we build the first Mandarin CS corpus comprising annotated videos from five speakers including three normal and two hearing impaired individuals. Consequently, we show that the hand preceding phenomenon exists in Mandarin CS production with significant differences between normal and hearing impaired people. Extensive experiments demonstrate that our model outperforms the baseline and the previous state-of-the-art methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/03/2020

A New Re-synchronization Method based Multi-modal Fusion for Automatic Continuous Cued Speech Recognition

Cued Speech (CS) is an augmented lip reading complemented by hand coding...
research
01/03/2020

A Pilot Study on Mandarin Chinese Cued Speech

Cued Speech (CS) is a communication system developed for deaf people, wh...
research
01/03/2020

Re-synchronization using the Hand Preceding Model for Multi-modal Fusion in Automatic Continuous Cued Speech Recognition

Cued Speech (CS) is an augmented lip reading complemented by hand coding...
research
06/26/2021

An Attention Self-supervised Contrastive Learning based Three-stage Model for Hand Shape Feature Representation in Cued Speech

Cued Speech (CS) is a communication system for deaf people or hearing im...
research
06/14/2023

Investigating the dynamics of hand and lips in French Cued Speech using attention mechanisms and CTC-based decoding

Hard of hearing or profoundly deaf people make use of cued speech (CS) a...
research
10/19/2022

A Data-Driven Investigation of Noise-Adaptive Utterance Generation with Linguistic Modification

In noisy environments, speech can be hard to understand for humans. Spok...
research
11/16/2022

Cognitive Simplification Operations Improve Text Simplification

Text Simplification (TS) is the task of converting a text into a form th...

Please sign up or login with your details

Forgot password? Click here to reset