Word-level Persian Lipreading Dataset

04/08/2023
by   Javad Peymanfard, et al.
0

Lip-reading has made impressive progress in recent years, driven by advances in deep learning. Nonetheless, the prerequisite such advances is a suitable dataset. This paper provides a new in-the-wild dataset for Persian word-level lipreading containing 244,000 videos from approximately 1,800 speakers. We evaluated the state-of-the-art method in this field and used a novel approach for word-level lip-reading. In this method, we used the AV-HuBERT model for feature extraction and obtained significantly better performance on our dataset.

READ FULL TEXT
research
02/27/2022

A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning

Large datasets as required for deep learning of lip reading do not exist...
research
06/21/2021

ROPE: Reading Order Equivariant Positional Encoding for Graph-based Document Information Extraction

Natural reading orders of words are crucial for information extraction f...
research
10/16/2018

LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild

Large-scale datasets have successively proven their fundamental importan...
research
12/28/2020

Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention

In this paper, we propose a novel deep learning architecture to improvin...
research
08/14/2019

A Cascade Sequence-to-Sequence Model for Chinese Mandarin Lip Reading

Lip reading aims at decoding texts from the movement of a speaker's mout...
research
02/19/2018

Deep Echo State Networks for Diagnosis of Parkinson's Disease

In this paper, we introduce a novel approach for diagnosis of Parkinson'...
research
10/03/2017

Finding phonemes: improving machine lip-reading

In machine lip-reading there is continued debate and research around the...

Please sign up or login with your details

Forgot password? Click here to reset