Sensitivity as a Complexity Measure for Sequence Classification Tasks

04/21/2021
by   Michael Hahn, et al.
6

We introduce a theoretical framework for understanding and predicting the complexity of sequence classification tasks, using a novel extension of the theory of Boolean function sensitivity. The sensitivity of a function, given a distribution over input sequences, quantifies the number of disjoint subsets of the input sequence that can each be individually changed to change the output. We argue that standard sequence classification methods are biased towards learning low-sensitivity functions, so that tasks requiring high sensitivity are more difficult. To that end, we show analytically that simple lexical classifiers can only express functions of bounded sensitivity, and we show empirically that low-sensitivity functions are easier to learn for LSTMs. We then estimate sensitivity on 15 NLP tasks, finding that sensitivity is higher on challenging tasks collected in GLUE than on simple text classification tasks, and that sensitivity predicts the performance both of simple lexical classifiers and of vanilla BiLSTMs without pretrained contextualized embeddings. Within a task, sensitivity predicts which inputs are hard for such simple models. Our results suggest that the success of massively pretrained contextual representations stems in part because they provide representations from which information can be extracted by low-sensitivity decoders.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2023

On the distribution of sensitivities of symmetric Boolean functions

A Boolean function f(x⃗) is sensitive to bit x_i if there is at least on...
research
11/22/2022

Simplicity Bias in Transformers and their Ability to Learn Sparse Boolean Functions

Despite the widespread success of Transformers on NLP tasks, recent work...
research
07/23/2021

On Boolean Functions with Low Polynomial Degree and Higher Order Sensitivity

Boolean functions are important primitives in different domains of crypt...
research
09/15/2021

Comparing Text Representations: A Theory-Driven Approach

Much of the progress in contemporary NLP has come from learning represen...
research
09/01/2019

Topics to Avoid: Demoting Latent Confounds in Text Classification

Despite impressive performance on many text classification tasks, deep n...
research
08/25/2023

Compressor-Based Classification for Atrial Fibrillation Detection

Atrial fibrillation (AF) is one of the most common arrhythmias with chal...
research
06/07/2022

Locality-sensitive bucketing functions for the edit distance

Many bioinformatics applications involve bucketing a set of sequences wh...

Please sign up or login with your details

Forgot password? Click here to reset