The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale

11/02/2018
by   Alina Kuznetsova, et al.
4

We present Open Images V4, a dataset of 9.2M images with unified annotations for image classification, object detection and visual relationship detection. The images have a Creative Commons Attribution license that allows to share and adapt the material, and they have been collected from Flickr without a predefined list of class names or tags, leading to natural class statistics and avoiding an initial design bias. Open Images V4 offers large scale across several dimensions: 30.1M image-level labels for 19.8k concepts, 15.4M bounding boxes for 600 object classes, and 375k visual relationship annotations involving 57 classes. For object detection in particular, we provide 15x more bounding boxes than the next largest datasets (15.4M boxes on 1.9M images). The images often show complex scenes with several objects (8 annotated objects per image on average). We annotated visual relationships between them, which support visual relationship detection, an emerging task that requires structured reasoning. We provide in-depth comprehensive statistics about the dataset, we validate the quality of the annotations, and we study how the performance of many modern models evolves with increasing amounts of training data. We hope that the scale, quality, and variety of Open Images V4 will foster further research and innovation even beyond the areas of image classification, object detection, and visual relationship detection.

READ FULL TEXT

page 2

page 3

page 4

page 6

page 7

page 12

page 13

page 15

research
10/03/2019

360-Indoor: Towards Learning Real-World Objects in 360° Indoor Equirectangular Images

While there are several widely used object detection datasets, current c...
research
12/14/2020

The Open Brands Dataset: Unified brand detection and recognition at scale

Intellectual property protection(IPP) have received more and more attent...
research
07/21/2021

A Public Ground-Truth Dataset for Handwritten Circuit Diagram Images

The development of digitization methods for line drawings (especially in...
research
05/28/2019

An Analysis of Object Embeddings for Image Retrieval

We present an analysis of embeddings extracted from different pre-traine...
research
04/07/2023

V3Det: Vast Vocabulary Visual Detection Dataset

Recent advances in detecting arbitrary objects in the real world are tra...
research
08/06/2020

IIIT-AR-13K: A New Dataset for Graphical Object Detection in Documents

We introduce a new dataset for graphical object detection in business do...
research
07/09/2020

VisImages: A Large-scale, High-quality Image Corpus in Visualization Publications

Images in visualization publications contain rich information, such as n...

Please sign up or login with your details

Forgot password? Click here to reset