On Utilizing Relationships for Transferable Few-Shot Fine-Grained Object Detection

12/01/2022
by   Ambar Pal, et al.
0

State-of-the-art object detectors are fast and accurate, but they require a large amount of well annotated training data to obtain good performance. However, obtaining a large amount of training annotations specific to a particular task, i.e., fine-grained annotations, is costly in practice. In contrast, obtaining common-sense relationships from text, e.g., "a table-lamp is a lamp that sits on top of a table", is much easier. Additionally, common-sense relationships like "on-top-of" are easy to annotate in a task-agnostic fashion. In this paper, we propose a probabilistic model that uses such relational knowledge to transform an off-the-shelf detector of coarse object categories (e.g., "table", "lamp") into a detector of fine-grained categories (e.g., "table-lamp"). We demonstrate that our method, RelDetect, achieves performance competitive to finetuning based state-of-the-art object detector baselines when an extremely low amount of fine-grained annotations is available (0.2% of entire dataset). We also demonstrate that RelDetect is able to utilize the inherent transferability of relationship information to obtain a better performance (+5 mAP points) than the above baselines on an unseen dataset (zero-shot transfer). In summary, we demonstrate the power of using relationships for object detection on datasets where fine-grained object categories can be linked to coarse-grained categories via suitable relationships.

READ FULL TEXT

page 2

page 8

research
02/28/2017

Cascade one-vs-rest detection network for fine-grained recognition without part annotations

Fine-grained recognition is a challenging task due to the small intra-ca...
research
10/05/2022

Relational Proxies: Emergent Relationships as Fine-Grained Discriminators

Fine-grained categories that largely share the same set of parts cannot ...
research
04/03/2018

Transferring Common-Sense Knowledge for Object Detection

We propose the idea of transferring common-sense knowledge from source c...
research
03/16/2023

Commonsense Knowledge Assisted Deep Learning for Resource-constrained and Fine-grained Object Detection

In this paper, we consider fine-grained image object detection in resour...
research
02/16/2020

Learning to Group: A Bottom-Up Framework for 3D Part Discovery in Unseen Categories

We address the problem of discovering 3D parts for objects in unseen cat...
research
09/12/2020

Exploring the Hierarchy in Relation Labels for Scene Graph Generation

By assigning each relationship a single label, current approaches formul...
research
12/05/2017

R-FCN-3000 at 30fps: Decoupling Detection and Classification

We present R-FCN-3000, a large-scale real-time object detector in which ...

Please sign up or login with your details

Forgot password? Click here to reset