Lip-reading with Densely Connected Temporal Convolutional Networks

09/29/2020
by   Pingchuan Ma, et al.
36

In this work, we present the Densely Connected Temporal Convolutional Network (DC-TCN) for lip-reading of isolated words. Although Temporal Convolutional Networks (TCN) have recently demonstrated great potential in many vision tasks, its receptive fields are not dense enough to model the complex temporal dynamics in lip-reading scenarios. To address this problem, we introduce dense connections into the network to capture more robust temporal features. Moreover, our approach utilises the Squeeze-and-Excitation block, a light-weight attention mechanism, to further enhance the model's classification power. Without bells and whistles, our DC-TCN method has achieved 88.36 accuracy on the Lip Reading in the Wild (LRW) dataset and 43.65 LRW-1000 dataset, which has surpassed all the baseline methods and is the new state-of-the-art on both datasets.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 7

page 8

page 9

page 10

research
09/03/2022

Training Strategies for Improved Lip-reading

Several training strategies and temporal models have been recently propo...
research
01/23/2020

Lipreading using Temporal Convolutional Networks

Lip-reading has attracted a lot of research attention lately thanks to a...
research
11/10/2018

Densely Connected Attention Propagation for Reading Comprehension

We propose DecaProp (Densely Connected Attention Propagation), a new den...
research
09/13/2017

Reading Scene Text with Attention Convolutional Sequence Modeling

Reading text in the wild is a challenging task in the field of computer ...
research
12/19/2021

Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition

We investigate densely connected convolutional networks (DenseNets) and ...
research
01/08/2019

Stable Electromyographic Sequence Prediction During Movement Transitions using Temporal Convolutional Networks

Transient muscle movements influence the temporal structure of myoelectr...
research
06/05/2017

Submanifold Sparse Convolutional Networks

Convolutional network are the de-facto standard for analysing spatio-tem...

Please sign up or login with your details

Forgot password? Click here to reset