VizWiz-FewShot: Locating Objects in Images Taken by People With Visual Impairments

07/24/2022
by   Yu-Yun Tseng, et al.
0

We introduce a few-shot localization dataset originating from photographers who authentically were trying to learn about the visual content in the images they took. It includes nearly 10,000 segmentations of 100 categories in over 4,500 images that were taken by people with visual impairments. Compared to existing few-shot object detection and instance segmentation datasets, our dataset is the first to locate holes in objects (e.g., found in 12.3% of our segmentations), it shows objects that occupy a much larger range of sizes relative to the images, and text is over five times more common in our objects (e.g., found in 22.4% of our segmentations). Analysis of three modern few-shot localization algorithms demonstrates that they generalize poorly to our new dataset. The algorithms commonly struggle to locate objects with holes, very small and very large objects, and objects lacking text. To encourage a larger community to work on these unsolved challenges, we publicly share our annotated few-shot dataset at https://vizwiz.org .

READ FULL TEXT

page 2

page 7

page 19

page 21

page 23

page 25

page 27

page 28

research
01/12/2023

Salient Object Detection for Images Taken by People With Vision Impairments

Salient object detection is the task of producing a binary mask for an i...
research
11/24/2022

One-Shot General Object Localization

This paper presents a general one-shot object localization algorithm cal...
research
04/15/2023

Few-shot Camouflaged Animal Detection and Segmentation

Camouflaged object detection and segmentation is a new and challenging r...
research
12/22/2022

SupeRGB-D: Zero-shot Instance Segmentation in Cluttered Indoor Environments

Object instance segmentation is a key challenge for indoor robots naviga...
research
01/26/2023

Cut and Learn for Unsupervised Object Detection and Instance Segmentation

We propose Cut-and-LEaRn (CutLER), a simple approach for training unsupe...
research
06/28/2021

One-Shot Affordance Detection

Affordance detection refers to identifying the potential action possibil...
research
08/26/2021

Few-shot Visual Relationship Co-localization

In this paper, given a small bag of images, each containing a common but...

Please sign up or login with your details

Forgot password? Click here to reset