SketchParse : Towards Rich Descriptions for Poorly Drawn Sketches using Multi-Task Hierarchical Deep Networks

The ability to semantically interpret hand-drawn line sketches, although very challenging, can pave way for novel applications in multimedia. We propose SketchParse, the first deep-network architecture for fully automatic parsing of freehand object sketches. SketchParse is configured as a two-level fully convolutional network. The first level contains shared layers common to all object categories. The second level contains a number of expert sub-networks. Each expert specializes in parsing sketches from object categories which contain structurally similar parts. Effectively, the two-level configuration enables our architecture to scale up efficiently as additional categories are added. We introduce a router layer which (i) relays sketch features from shared layers to the correct expert (ii) eliminates the need to manually specify object category during inference. To bypass laborious part-level annotation, we sketchify photos from semantic object-part image datasets and use them for training. Our architecture also incorporates object pose prediction as a novel auxiliary task which boosts overall performance while providing supplementary information regarding the sketch. We demonstrate SketchParse's abilities (i) on two challenging large-scale sketch datasets (ii) in parsing unseen, semantically related object categories (iii) in improving fine-grained sketch-based image retrieval. As a novel application, we also outline how SketchParse's output can be used to generate caption-style descriptions for hand-drawn sketches.

READ FULL TEXT

page 6

page 11

page 12

page 14

page 15

research
08/15/2019

SFSegNet: Parse Freehand Sketches using Deep Fully Convolutional Networks

Parsing sketches via semantic segmentation is attractive but challenging...
research
08/07/2018

Universal Perceptual Grouping

In this work we aim to develop a universal sketch grouper. That is, a gr...
research
08/14/2020

Sketch-Guided Object Localization in Natural Images

We introduce the novel problem of localizing all the instances of an obj...
research
03/23/2023

CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not

In this paper, we leverage CLIP for zero-shot sketch based image retriev...
research
06/20/2020

Semantically Tied Paired Cycle Consistency for Any-Shot Sketch-based Image Retrieval

Low-shot sketch-based image retrieval is an emerging task in computer vi...
research
03/04/2017

Looking at Outfit to Parse Clothing

This paper extends fully-convolutional neural networks (FCN) for the clo...
research
02/26/2020

Learning to Shadow Hand-drawn Sketches

We present a fully automatic method to generate detailed and accurate ar...

Please sign up or login with your details

Forgot password? Click here to reset