Zero-Shot Sketch Based Image Retrieval using Graph Transformer

01/25/2022
by   Sumrit Gupta, et al.
1

The performance of a zero-shot sketch-based image retrieval (ZS-SBIR) task is primarily affected by two challenges. The substantial domain gap between image and sketch features needs to be bridged, while at the same time the side information has to be chosen tactfully. Existing literature has shown that varying the semantic side information greatly affects the performance of ZS-SBIR. To this end, we propose a novel graph transformer based zero-shot sketch-based image retrieval (GTZSR) framework for solving ZS-SBIR tasks which uses a novel graph transformer to preserve the topology of the classes in the semantic space and propagates the context-graph of the classes within the embedding features of the visual space. To bridge the domain gap between the visual features, we propose minimizing the Wasserstein distance between images and sketches in a learned domain-shared space. We also propose a novel compatibility loss that further aligns the two visual domains by bridging the domain gap of one class with respect to the domain gap of all other classes in the training set. Experimental results obtained on the extended Sketchy, TU-Berlin, and QuickDraw datasets exhibit sharp improvements over the existing state-of-the-art methods in both ZS-SBIR and generalized ZS-SBIR.

READ FULL TEXT
research
01/17/2023

Distribution Aligned Feature Clustering for Zero-Shot Sketch-Based Image Retrieval

Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) is a challenging cross-...
research
01/17/2022

BDA-SketRet: Bi-Level Domain Adaptation for Zero-Shot SBIR

The efficacy of zero-shot sketch-based image retrieval (ZS-SBIR) models ...
research
03/22/2020

Progressive Domain-Independent Feature Decomposition Network for Zero-Shot Sketch-Based Image Retrieval

Zero-shot sketch-based image retrieval (ZS-SBIR) is a specific cross-mod...
research
04/20/2021

CrossATNet - A Novel Cross-Attention Based Framework for Sketch-Based Image Retrieval

We propose a novel framework for cross-modal zero-shot learning (ZSL) in...
research
02/24/2020

Sketchformer: Transformer-based Representation for Sketched Structure

Sketchformer is a novel transformer-based representation for encoding fr...
research
01/18/2020

Stacked Adversarial Network for Zero-Shot Sketch based Image Retrieval

Conventional approaches to Sketch-Based Image Retrieval (SBIR) assume th...
research
04/06/2019

Doodle to Search: Practical Zero-Shot Sketch-based Image Retrieval

In this paper, we investigate the problem of zero-shot sketch-based imag...

Please sign up or login with your details

Forgot password? Click here to reset