Weak Multi-View Supervision for Surface Mapping Estimation

05/04/2021
by   Nishant Rai, et al.
4

We propose a weakly-supervised multi-view learning approach to learn category-specific surface mapping without dense annotations. We learn the underlying surface geometry of common categories, such as human faces, cars, and airplanes, given instances from those categories. While traditional approaches solve this problem using extensive supervision in the form of pixel-level annotations, we take advantage of the fact that pixel-level UV and mesh predictions can be combined with 3D reprojections to form consistency cycles. As a result of exploiting these cycles, we can establish a dense correspondence mapping between image pixels and the mesh acting as a self-supervisory signal, which in turn helps improve our overall estimates. Our approach leverages information from multiple views of the object to establish additional consistency cycles, thus improving surface mapping understanding without the need for explicit annotations. We also propose the use of deformation fields for predictions of an instance specific mesh. Given the lack of datasets providing multiple images of similar object instances from different viewpoints, we generate and release a multi-view ShapeNet Cars and Airplanes dataset created by rendering ShapeNet meshes using a 360 degree camera trajectory around the mesh. For the human faces category, we process and adapt an existing dataset to a multi-view setup. Through experimental evaluations, we show that, at test time, our method can generate accurate variations away from the mean shape, is multi-view consistent, and performs comparably to fully supervised approaches.

READ FULL TEXT

page 1

page 2

page 5

page 8

research
03/13/2020

Self-supervised Single-view 3D Reconstruction via Semantic Consistency

We learn a self-supervised, single-view 3D reconstruction model that pre...
research
07/23/2019

Canonical Surface Mapping via Geometric Cycle Consistency

We explore the task of Canonical Surface Mapping (CSM). Specifically, gi...
research
03/18/2020

DeepCap: Monocular Human Performance Capture Using Weak Supervision

Human performance capture is a highly important computer vision problem ...
research
01/16/2020

SketchDesc: Learning Local Sketch Descriptors for Multi-view Correspondence

In this paper, we study the problem of multi-view sketch correspondence,...
research
02/26/2022

Uncertainty-Aware Deep Multi-View Photometric Stereo

This paper presents a simple and effective solution to the problem of mu...
research
01/15/2023

Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human Mesh Recovery

Regression-based methods have shown high efficiency and effectiveness fo...
research
11/20/2021

A Deeper Look into DeepCap

Human performance capture is a highly important computer vision problem ...

Please sign up or login with your details

Forgot password? Click here to reset