Geometry Constrained Weakly Supervised Object Localization

07/19/2020
by   Weizeng Lu, et al.
11

We propose a geometry constrained network, termed GC-Net, for weakly supervised object localization (WSOL). GC-Net consists of three modules: a detector, a generator and a classifier. The detector predicts the object location defined by a set of coefficients describing a geometric shape (i.e. ellipse or rectangle), which is geometrically constrained by the mask produced by the generator. The classifier takes the resulting masked images as input and performs two complementary classification tasks for the object and background. To make the mask more compact and more complete, we propose a novel multi-task loss function that takes into account area of the geometric shape, the categorical cross-entropy and the negative entropy. In contrast to previous approaches, GC-Net is trained end-to-end and predict object location without any post-processing (e.g. thresholding) that may require additional tuning. Extensive experiments on the CUB-200-2011 and ILSVRC2012 datasets show that GC-Net outperforms state-of-the-art methods by a large margin. Our source code is available at https://github.com/lwzeng/GC-Net.

READ FULL TEXT

page 2

page 7

page 9

page 11

page 14

research
12/29/2021

Background-aware Classification Activation Map for Weakly Supervised Object Localization

Weakly supervised object localization (WSOL) relaxes the requirement of ...
research
07/31/2023

DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization

Weakly-supervised temporal action localization (WTAL) is a practical yet...
research
09/21/2017

AffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection

We propose AffordanceNet, a new deep learning approach to simultaneously...
research
08/22/2019

3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization

Temporal action localization is a challenging computer vision problem wi...
research
09/07/2023

Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

It has been established that training a box-based detector network can e...
research
06/15/2019

Mask Based Unsupervised Content Transfer

We consider the problem of translating, in an unsupervised manner, betwe...
research
09/06/2023

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

3D dense captioning requires a model to translate its understanding of a...

Please sign up or login with your details

Forgot password? Click here to reset