An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction

05/01/2020
by   Bhargavi Paranjape, et al.
0

Decisions of complex language understanding models can be rationalized by limiting their inputs to a relevant subsequence of the original text. A rationale should be as concise as possible without significantly degrading task performance, but this balance can be difficult to achieve in practice. In this paper, we show that it is possible to better manage this trade-off by optimizing a bound on the Information Bottleneck (IB) objective. Our fully unsupervised approach jointly learns an explainer that predicts sparse binary masks over sentences, and an end-task predictor that considers only the extracted rationale. Using IB, we derive a learning objective that allows direct control of mask sparsity levels through a tunable sparse prior. Experiments on ERASER benchmark tasks demonstrate significant gains over norm-minimization techniques for both task performance and agreement with human rationales. Furthermore, we find that in the semi-supervised setting, a modest amount of gold rationales (25 model that uses the full input. Code: https://github.com/bhargaviparanjape/explainable_qa

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2019

Extracting robust and accurate features via a robust information bottleneck

We propose a novel strategy for extracting features in supervised learni...
research
09/29/2019

Semi-Supervised Neural Text Generation by Joint Learning of Natural Language Generation and Natural Language Understanding Models

In Natural Language Generation (NLG), End-to-End (E2E) systems trained t...
research
09/13/2022

PointScatter: Point Set Representation for Tubular Structure Extraction

This paper explores the point set representation for tubular structure e...
research
05/30/2023

How Does Information Bottleneck Help Deep Learning?

Numerous deep learning algorithms have been inspired by and understood v...
research
07/22/2019

Information-Bottleneck Approach to Salient Region Discovery

We propose a new method for learning image attention masks in a semi-sup...
research
05/31/2019

Improving Open Information Extraction via Iterative Rank-Aware Learning

Open information extraction (IE) is the task of extracting open-domain a...
research
09/09/2021

SPECTRA: Sparse Structured Text Rationalization

Selective rationalization aims to produce decisions along with rationale...

Please sign up or login with your details

Forgot password? Click here to reset