Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens

05/06/2018
by   Marek Rei, et al.
0

Can attention- or gradient-based visualization techniques be used to infer token-level labels for binary sequence tagging problems, using networks trained only on sentence-level labels? We construct a neural network architecture based on soft attention, train it as a binary sentence classifier and evaluate against token-level annotation on four different datasets. Inferring token labels from a network provides a method for quantitatively evaluating what the model is learning, along with generating useful feedback in assistance systems. Our results indicate that attention-based methods are able to predict token-level labels more accurately, compared to gradient-based methods, sometimes even rivaling the supervised oracle network.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2019

Toward Grammatical Error Detection from Sentence Labels: Zero-shot Sequence Labeling with CNNs and Contextualized Embeddings

Zero-shot grammatical error detection is the task of tagging token-level...
research
03/26/2021

Turning transformer attention weights into zero-shot sequence labelers

We demonstrate how transformer-based models can be redesigned in order t...
research
11/14/2018

Jointly Learning to Label Sentences and Tokens

Learning to construct text representations in end-to-end systems can be ...
research
04/24/2019

Generating Token-Level Explanations for Natural Language Inference

The task of Natural Language Inference (NLI) is widely modeled as superv...
research
07/16/2023

Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling

Few-shot sequence labeling aims to identify novel classes based on only ...
research
07/16/2017

End-to-End Information Extraction without Token-Level Supervision

Most state-of-the-art information extraction approaches rely on token-le...
research
12/02/2019

An Attention-Based Speaker Naming Method for Online Adaptation in Non-Fixed Scenarios

A speaker naming task, which finds and identifies the active speaker in ...

Please sign up or login with your details

Forgot password? Click here to reset