Learning Landmarks from Unaligned Data using Image Translation

07/03/2019
by   Tomas Jakab, et al.
2

We introduce a method for learning landmark detectors from unlabelled video frames and unpaired labels. This allows us to learn a detector from a large collection of raw videos given only a few example annotations harvested from existing data or motion capture. We achieve this by formulating the landmark detection task as one of image translation, learning to map an image of the object to an image of its landmarks, represented as a skeleton. The advantage is that this translation problem can then be tackled by CycleGAN. However, we show that a naive application of CycleGAN confounds appearance and pose information, with suboptimal keypoint detection performance. We solve this problem by introducing an analytical and differentiable renderer for the skeleton image so that no appearance information can be leaked in the skeleton. Then, since cycle consistency requires to reconstruct the input image from the skeleton, we supply the appearance information thus removed by conditioning the generator with a second image of the same object (e.g. another frame from a video). Furthermore, while CycleGAN uses two cycle consistency constraints, we show that the second one is detrimental in this application and we discard it, significantly simplifying the model. We show that these modifications improve the quality of the learned detector leading to state-of-the-art unsupervised landmark detection performance in a number of challenging human pose and facial landmark detection benchmarks.

READ FULL TEXT

page 13

page 14

page 15

page 16

page 17

page 18

page 19

page 20

research
01/26/2020

Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos

Unsupervised landmark learning is the task of learning semantic keypoint...
research
04/08/2021

Generative Landmarks

We propose a general purpose approach to detect landmarks with improved ...
research
06/20/2018

Conditional Image Generation for Learning the Structure of Visual Objects

In this paper, we consider the problem of learning landmarks for object ...
research
06/29/2020

Unsupervised Landmark Learning from Unpaired Data

Recent attempts for unsupervised landmark learning leverage synthesized ...
research
02/02/2021

U-LanD: Uncertainty-Driven Video Landmark Detection

This paper presents U-LanD, a framework for joint detection of key frame...
research
05/31/2022

From Keypoints to Object Landmarks via Self-Training Correspondence: A novel approach to Unsupervised Landmark Discovery

This paper proposes a novel paradigm for the unsupervised learning of ob...
research
12/02/2020

Mutual Information Maximization on Disentangled Representations for Differential Morph Detection

In this paper, we present a novel differential morph detection framework...

Please sign up or login with your details

Forgot password? Click here to reset