Sketch-based Video Object Localization

04/02/2023
by   Sangmin Woo, et al.
0

We introduce Sketch-based Video Object Localization (SVOL), a new task aimed at localizing spatio-temporal object boxes in video queried by the input sketch. We first outline the challenges in the SVOL task and build the Sketch-Video Attention Network (SVANet) with the following design principles: (i) to consider temporal information of video and bridge the domain gap between sketch and video; (ii) to accurately identify and localize multiple objects simultaneously; (iii) to handle various styles of sketches; (iv) to be classification-free. In particular, SVANet is equipped with a Cross-modal Transformer that models the interaction between learnable object tokens, query sketch, and video through attention operations, and learns upon a per-frame set macthing strategy that enables frame-wise prediction while utilizing global video context. We evaluate SVANet on a newly curated SVOL dataset. By design, SVANet successfully learns the mapping between the query sketch and video objects, achieving state-of-the-art results on the SVOL benchmark. We further confirm the effectiveness of SVANet via extensive ablation studies and visualizations. Lastly, we demonstrate its zero-shot capability on unseen datasets and novel categories, suggesting its high scalability in real-world applications.

READ FULL TEXT

page 1

page 2

page 9

page 11

page 14

page 16

page 17

research
08/14/2020

Sketch-Guided Object Localization in Natural Images

We introduce the novel problem of localizing all the instances of an obj...
research
03/15/2023

Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch

In this work, we investigate the problem of sketch-based object localiza...
research
05/11/2019

Deep Zero-Shot Learning for Scene Sketch

We introduce a novel problem of scene sketch zero-shot learning (SSZSL),...
research
10/10/2021

Sketch Me A Video

Video creation has been an attractive yet challenging task for artists t...
research
09/24/2021

Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild

This work investigates the problem of sketch-guided object localization ...
research
12/01/2022

Multimodal Query-guided Object Localization

Consider a scenario in one-shot query-guided object localization where n...
research
08/11/2023

Uncertainty-Aware Cross-Modal Transfer Network for Sketch-Based 3D Shape Retrieval

In recent years, sketch-based 3D shape retrieval has attracted growing a...

Please sign up or login with your details

Forgot password? Click here to reset