Visual speech recognition: aligning terminologies for better understanding

10/03/2017
by   Helen L Bear, et al.
0

We are at an exciting time for machine lipreading. Traditional research stemmed from the adaptation of audio recognition systems. But now, the computer vision community is also participating. This joining of two previously disparate areas with different perspectives on computer lipreading is creating opportunities for collaborations, but in doing so the literature is experiencing challenges in knowledge sharing due to multiple uses of terms and phrases and the range of methods for scoring results. In particular we highlight three areas with the intention to improve communication between those researching lipreading; the effects of interchanging between speech reading and lipreading; speaker dependence across train, validation, and test splits; and the use of accuracy, correctness, errors, and varying units (phonemes, visemes, words, and sentences) to measure system performance. We make recommendations as to how we can be more consistent.

READ FULL TEXT

page 2

page 3

page 4

research
09/06/2018

Deep Audio-Visual Speech Recognition

The goal of this work is to recognise phrases and sentences being spoken...
research
11/16/2016

Lip Reading Sentences in the Wild

The goal of this work is to recognise phrases and sentences being spoken...
research
10/03/2017

Which phoneme-to-viseme maps best improve visual-only computer lip-reading?

A critical assumption of all current visual speech recognition systems i...
research
09/03/2014

Visual Speech Recognition

Lip reading is used to understand or interpret speech without hearing it...
research
10/03/2017

Decoding visemes: improving machine lipreading (PhD thesis)

Machine lipreading (MLR) is speech recognition from visual cues and a ni...
research
10/03/2017

Finding phonemes: improving machine lip-reading

In machine lip-reading there is continued debate and research around the...
research
06/14/2023

Automated Speaker Independent Visual Speech Recognition: A Comprehensive Survey

Speaker-independent VSR is a complex task that involves identifying spok...

Please sign up or login with your details

Forgot password? Click here to reset