Generative Model with Coordinate Metric Learning for Object Recognition Based on 3D Models

05/24/2017
by   Yida Wang, et al.
0

Given large amount of real photos for training, Convolutional neural network shows excellent performance on object recognition tasks. However, the process of collecting data is so tedious and the background are also limited which makes it hard to establish a perfect database. In this paper, our generative model trained with synthetic images rendered from 3D models reduces the workload of data collection and limitation of conditions. Our structure is composed of two sub-networks: semantic foreground object reconstruction network based on Bayesian inference and classification network based on multi-triplet cost function for avoiding over-fitting problem on monotone surface and fully utilizing pose information by establishing sphere-like distribution of descriptors in each category which is helpful for recognition on regular photos according to poses, lighting condition, background and category information of rendered images. Firstly, our conjugate structure called generative model with metric learning utilizing additional foreground object channels generated from Bayesian rendering as the joint of two sub-networks. Multi-triplet cost function based on poses for object recognition are used for metric learning which makes it possible training a category classifier purely based on synthetic data. Secondly, we design a coordinate training strategy with the help of adaptive noises acting as corruption on input images to help both sub-networks benefit from each other and avoid inharmonious parameter tuning due to different convergence speed of two sub-networks. Our structure achieves the state of the art accuracy of over 50% on ShapeNet database with data migration obstacle from synthetic images to real photos. This pipeline makes it applicable to do recognition on real images only based on 3D models.

READ FULL TEXT

page 1

page 3

page 6

page 9

page 11

page 12

research
11/26/2018

Unsupervised 3D Shape Learning from Image Collections in the Wild

We present a method to learn the 3D surface of objects directly from a c...
research
05/10/2021

BIM Hyperreality: Data Synthesis Using BIM and Hyperrealistic Rendering for Deep Learning

Deep learning is expected to offer new opportunities and a new paradigm ...
research
11/14/2017

TripletGAN: Training Generative Model with Triplet Loss

As an effective way of metric learning, triplet loss has been widely use...
research
12/17/2015

Deep Active Object Recognition by Joint Label and Action Prediction

An active object recognition system has the advantage of being able to a...
research
08/27/2019

Large Scale Landmark Recognition via Deep Metric Learning

This paper presents a novel approach for landmark recognition in images ...
research
10/04/2022

On Background Bias in Deep Metric Learning

Deep Metric Learning trains a neural network to map input images to a lo...
research
05/11/2021

ORCEA: Object Recognition by Continuous Evidence Assimilation

ORCEA is a novel object recognition method applicable for objects descri...

Please sign up or login with your details

Forgot password? Click here to reset