Recurrence without Recurrence: Stable Video Landmark Detection with Deep Equilibrium Models

04/02/2023
by   Paul Micaelli, et al.
0

Cascaded computation, whereby predictions are recurrently refined over several stages, has been a persistent theme throughout the development of landmark detection models. In this work, we show that the recently proposed Deep Equilibrium Model (DEQ) can be naturally adapted to this form of computation. Our Landmark DEQ (LDEQ) achieves state-of-the-art performance on the challenging WFLW facial landmark dataset, reaching 3.92 NME with fewer parameters and a training memory cost of 𝒪(1) in the number of recurrent modules. Furthermore, we show that DEQs are particularly suited for landmark detection in videos. In this setting, it is typical to train on still images due to the lack of labelled videos. This can lead to a “flickering” effect at inference time on video, whereby a model can rapidly oscillate between different plausible solutions across consecutive frames. By rephrasing DEQs as a constrained optimization, we emulate recurrence at inference time, despite not having access to temporal data at training time. This Recurrence without Recurrence (RwR) paradigm helps in reducing landmark flicker, which we demonstrate by introducing a new metric, normalized mean flicker (NMF), and contributing a new facial landmark video dataset (WFLW-V) targeting landmark uncertainty. On the WFLW-V hard subset made up of 500 videos, our LDEQ with RwR improves the NME and NMF by 10 and 13% respectively, compared to the strongest previously published model using a hand-tuned conventional filter.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 14

research
10/26/2019

FAB: A Robust Facial Landmark Detection Framework for Motion-Blurred Videos

Recently, facial landmark detection algorithms have achieved remarkable ...
research
11/26/2016

Convolutional Experts Constrained Local Model for Facial Landmark Detection

Constrained Local Models (CLMs) are a well-established family of methods...
research
02/08/2023

Neonatal Face and Facial Landmark Detection from Video Recordings

This paper explores automated face and facial landmark detection of neon...
research
02/01/2021

Landmark Breaker: Obstructing DeepFake By Disturbing Landmark Extraction

The recent development of Deep Neural Networks (DNN) has significantly i...
research
02/06/2018

Every Smile is Unique: Landmark-Guided Diverse Smile Generation

Each smile is unique: one person surely smiles in different ways (e.g., ...
research
02/02/2021

U-LanD: Uncertainty-Driven Video Landmark Detection

This paper presents U-LanD, a framework for joint detection of key frame...
research
09/29/2022

Facial Landmark Predictions with Applications to Metaverse

This research aims to make metaverse characters more realistic by adding...

Please sign up or login with your details

Forgot password? Click here to reset