Generating Visual Explanations

03/28/2016
by   Lisa Anne Hendricks, et al.
0

Clearly explaining a rationale for a classification decision to an end-user can be as important as the decision itself. Existing approaches for deep visual recognition are generally opaque and do not output any justification text; contemporary vision-language models can describe image content but fail to take into account class-discriminative image aspects which justify visual predictions. We propose a new model that focuses on the discriminating properties of the visible object, jointly predicts a class label, and explains why the predicted label is appropriate for the image. We propose a novel loss function based on sampling and reinforcement learning that learns to generate sentences that realize a global sentence property, such as class specificity. Our results on a fine-grained bird species classification dataset show that our model is able to generate explanations which are not only consistent with an image but also more discriminative than descriptions produced by existing captioning methods.

READ FULL TEXT

page 12

page 13

page 14

research
06/26/2018

Generating Counterfactual Explanations with Natural Language

Natural language explanations of deep neural network decisions provide a...
research
06/06/2019

Context-Aware Visual Policy Network for Fine-Grained Image Captioning

With the maturity of visual detection techniques, we are more ambitious ...
research
03/24/2022

Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals

A visual counterfactual explanation replaces image regions in a query im...
research
09/09/2019

Neural Naturalist: Generating Fine-Grained Image Comparisons

We introduce the new Birds-to-Words dataset of 41k sentences describing ...
research
08/19/2021

Fine-Grained Element Identification in Complaint Text of Internet Fraud

Existing system dealing with online complaint provides a final decision ...
research
08/28/2021

Goal-driven text descriptions for images

A big part of achieving Artificial General Intelligence(AGI) is to build...
research
11/17/2017

Grounding Visual Explanations (Extended Abstract)

Existing models which generate textual explanations enforce task relevan...

Please sign up or login with your details

Forgot password? Click here to reset