ImageSubject: A Large-scale Dataset for Subject Detection

01/09/2022
by   Xin Miao, et al.
14

Main subjects usually exist in the images or videos, as they are the objects that the photographer wants to highlight. Human viewers can easily identify them but algorithms often confuse them with other objects. Detecting the main subjects is an important technique to help machines understand the content of images and videos. We present a new dataset with the goal of training models to understand the layout of the objects and the context of the image then to find the main subjects among them. This is achieved in three aspects. By gathering images from movie shots created by directors with professional shooting skills, we collect the dataset with strong diversity, specifically, it contains 107 700 images from 21 540 movie shots. We labeled them with the bounding box labels for two classes: subject and non-subject foreground object. We present a detailed analysis of the dataset and compare the task with saliency detection and object detection. ImageSubject is the first dataset that tries to localize the subject in an image that the photographer wants to highlight. Moreover, we find the transformer-based detection model offers the best result among other popular model architectures. Finally, we discuss the potential applications and conclude with the importance of the dataset.

READ FULL TEXT

page 2

page 4

page 5

page 7

page 8

research
05/01/2014

Microsoft COCO: Common Objects in Context

We present a new dataset with the goal of advancing the state-of-the-art...
research
04/18/2018

Falling Things: A Synthetic Dataset for 3D Object Detection and Pose Estimation

We present a new dataset, called Falling Things (FAT), for advancing the...
research
11/27/2017

Scalable Object Detection for Stylized Objects

Following recent breakthroughs in convolutional neural networks and mono...
research
03/18/2020

Rethinking Object Detection in Retail Stores

The convention standard for object detection uses a bounding box to repr...
research
05/16/2023

Understanding 3D Object Interaction from a Single Image

Humans can easily understand a single image as depicting multiple potent...
research
11/07/2021

Natural Adversarial Objects

Although state-of-the-art object detection methods have shown compelling...
research
12/15/2021

CPPE-5: Medical Personal Protective Equipment Dataset

We present a new challenging dataset, CPPE - 5 (Medical Personal Protect...

Please sign up or login with your details

Forgot password? Click here to reset