On Learning Semantic Representations for Million-Scale Free-Hand Sketches

07/07/2020
by   Peng Xu, et al.
7

In this paper, we study learning semantic representations for million-scale free-hand sketches. This is highly challenging due to the domain-unique traits of sketches, e.g., diverse, sparse, abstract, noisy. We propose a dual-branch CNNRNN network architecture to represent sketches, which simultaneously encodes both the static and temporal patterns of sketch strokes. Based on this architecture, we further explore learning the sketch-oriented semantic representations in two challenging yet practical settings, i.e., hashing retrieval and zero-shot recognition on million-scale sketches. Specifically, we use our dual-branch architecture as a universal representation framework to design two sketch-specific deep models: (i) We propose a deep hashing model for sketch retrieval, where a novel hashing loss is specifically designed to accommodate both the abstract and messy traits of sketches. (ii) We propose a deep embedding model for sketch zero-shot recognition, via collecting a large-scale edge-map dataset and proposing to extract a set of semantic vectors from edge-maps as the semantic knowledge for sketch zero-shot domain alignment. Both deep models are evaluated by comprehensive experiments on million-scale sketches and outperform the state-of-the-art competitors.

READ FULL TEXT

page 3

page 4

page 6

page 8

page 9

page 10

page 11

page 12

research
04/04/2018

SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval

We propose a deep hashing framework for sketch retrieval that, for the f...
research
05/11/2019

Deep Zero-Shot Learning for Scene Sketch

We introduce a novel problem of scene sketch zero-shot learning (SSZSL),...
research
02/03/2020

Deep Self-Supervised Representation Learning for Free-Hand Sketch

In this paper, we tackle for the first time, the problem of self-supervi...
research
03/06/2018

Zero-Shot Sketch-Image Hashing

Recent studies show that large-scale sketch-based image retrieval (SBIR)...
research
02/11/2022

WAD-CMSN: Wasserstein Distance based Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval

Zero-shot sketch-based image retrieval (ZSSBIR), as a popular studied br...
research
08/12/2020

A Zero-Shot Sketch-based Inter-Modal Object Retrieval Scheme for Remote Sensing Images

Conventional existing retrieval methods in remote sensing (RS) are often...
research
06/03/2022

Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image Synthesis

In this work, we propose and validate a framework to leverage language-i...

Please sign up or login with your details

Forgot password? Click here to reset