Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation

08/13/2020
by   Jialian Wu, et al.
27

Despite the previous success of object analysis, detecting and segmenting a large number of object categories with a long-tailed data distribution remains a challenging problem and is less investigated. For a large-vocabulary classifier, the chance of obtaining noisy logits is much higher, which can easily lead to a wrong recognition. In this paper, we exploit prior knowledge of the relations among object categories to cluster fine-grained classes into coarser parent classes, and construct a classification tree that is responsible for parsing an object instance into a fine-grained category via its parent class. In the classification tree, as the number of parent class nodes are significantly less, their logits are less noisy and can be utilized to suppress the wrong/noisy logits existed in the fine-grained class nodes. As the way to construct the parent class is not unique, we further build multiple trees to form a classification forest where each tree contributes its vote to the fine-grained classification. To alleviate the imbalanced learning caused by the long-tail phenomena, we propose a simple yet effective resampling method, NMS Resampling, to re-balance the data distribution. Our method, termed as Forest R-CNN, can serve as a plug-and-play module being applied to most object recognition models for recognizing more than 1000 categories. Extensive experiments are performed on the large vocabulary dataset LVIS. Compared with the Mask R-CNN baseline, the Forest R-CNN significantly boosts the performance with 11.5 categories, respectively. Moreover, we achieve state-of-the-art results on the LVIS dataset. Code is available at https://github.com/JialianW/Forest_RCNN.

READ FULL TEXT

page 4

page 8

research
03/11/2020

Equalization Loss for Long-Tailed Object Recognition

Object recognition techniques using convolutional neural networks (CNN) ...
research
07/23/2020

The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation

Most existing object instance detection and segmentation models only wor...
research
06/18/2020

Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax

Solving long-tail large vocabulary object detection with deep learning b...
research
06/20/2021

Solution for Large-scale Long-tailed Recognition with Noisy Labels

This is a technical report for CVPR 2021 AliProducts Challenge. AliProdu...
research
03/18/2021

Danish Fungi 2020 – Not Just Another Image Recognition Dataset

We introduce a novel fine-grained dataset and benchmark, the Danish Fung...
research
03/22/2022

Fine-Grained Scene Graph Generation with Data Transfer

Scene graph generation (SGG) aims to extract (subject, predicate, object...
research
11/09/2016

Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data

We demonstrate that a generative model for object shapes can achieve sta...

Please sign up or login with your details

Forgot password? Click here to reset