Supervision-by-Registration: An Unsupervised Approach to Improve the Precision of Facial Landmark Detectors

07/03/2018
by   Xuanyi Dong, et al.
0

In this paper, we present supervision-by-registration, an unsupervised approach to improve the precision of facial landmark detectors on both images and video. Our key observation is that the detections of the same landmark in adjacent frames should be coherent with registration, i.e., optical flow. Interestingly, the coherency of optical flow is a source of supervision that does not require manual labeling, and can be leveraged during detector training. For example, we can enforce in the training loss function that a detected landmark at frame_t-1 followed by optical flow tracking from frame_t-1 to frame_t should coincide with the location of the detection at frame_t. Essentially, supervision-by-registration augments the training loss function with a registration loss, thus training the detector to have output that is not only close to the annotations in labeled images, but also consistent with registration on large amounts of unlabeled videos. End-to-end training with the registration loss is made possible by a differentiable Lucas-Kanade operation, which computes optical flow registration in the forward pass, and back-propagates gradients that encourage temporal coherency in the detector. The output of our method is a more precise image-based facial landmark detector, which can be applied to single images or video. With supervision-by-registration, we demonstrate (1) improvements in facial landmark detection on both images (300W, ALFW) and video (300VW, Youtube-Celebrities), and (2) significant reduction of jittering in video detections.

READ FULL TEXT

page 1

page 2

page 5

page 7

research
01/25/2021

Supervision by Registration and Triangulation for Landmark Detection

We present Supervision by Registration and Triangulation (SRT), an unsup...
research
11/27/2018

Multiview Supervision By Registration

This paper presents a semi-supervised learning framework to train a keyp...
research
08/20/2016

Back to Basics: Unsupervised Learning of Optical Flow via Brightness Constancy and Motion Smoothness

Recently, convolutional networks (convnets) have proven useful for predi...
research
11/26/2016

Convolutional Experts Constrained Local Model for Facial Landmark Detection

Constrained Local Models (CLMs) are a well-established family of methods...
research
03/26/2016

Video Interpolation using Optical Flow and Laplacian Smoothness

Non-rigid video interpolation is a common computer vision task. In this ...
research
10/28/2020

GloFlow: Global Image Alignment for Creation of Whole Slide Images for Pathology from Video

The application of deep learning to pathology assumes the existence of d...
research
08/02/2019

Adaloss: Adaptive Loss Function for Landmark Localization

Landmark localization is a challenging problem in computer vision with a...

Please sign up or login with your details

Forgot password? Click here to reset