Combining Texture and Shape Cues for Object Recognition With Minimal Supervision

by   Xingchao Peng, et al.

We present a novel approach to object classification and detection which requires minimal supervision and which combines visual texture cues and shape information learned from freely available unlabeled web search results. The explosion of visual data on the web can potentially make visual examples of almost any object easily accessible via web search. Previous unsupervised methods have utilized either large scale sources of texture cues from the web, or shape information from data such as crowdsourced CAD models. We propose a two-stream deep learning framework that combines these cues, with one stream learning visual texture cues from image search data, and the other stream learning rich shape information from 3D CAD models. To perform classification or detection for a novel image, the predictions of the two streams are combined using a late fusion scheme. We present experiments and visualizations for both tasks on the standard benchmark PASCAL VOC 2007 to demonstrate that texture and shape provide complementary information in our model. Our method outperforms previous web image based models, 3D CAD model based approaches, and weakly supervised models.


page 8

page 9

page 13

page 17

page 18


Learning Deep Object Detectors from 3D Models

Crowdsourced 3D CAD models are becoming easily accessible online, and ca...

Learning Shape and Texture Characteristics of CT Tree-in-Bud Opacities for CAD Systems

Although radiologists can employ CAD systems to characterize malignancie...

Planes vs. Chairs: Category-guided 3D shape learning without any 3D cues

We present a novel 3D shape reconstruction method which learns to predic...

Real-time texturing for 6D object instance detection from RGB Images

For objected detection, the availability of color cues strongly influenc...

3D-FUTURE: 3D Furniture shape with TextURE

The 3D CAD shapes in current 3D benchmarks are mostly collected from onl...

Visual Vocabulary Learning and Its Application to 3D and Mobile Visual Search

In this technical report, we review related works and recent trends in v...

Shape Reconstruction and Recognition with Isolated Non-directional Cues

The paper investigates a hypothesis that our visual system groups visual...

Please sign up or login with your details

Forgot password? Click here to reset