Unifying Token and Span Level Supervisions for Few-Shot Sequence Labeling

07/16/2023
by   Zifeng Cheng, et al.
0

Few-shot sequence labeling aims to identify novel classes based on only a few labeled samples. Existing methods solve the data scarcity problem mainly by designing token-level or span-level labeling models based on metric learning. However, these methods are only trained at a single granularity (i.e., either token level or span level) and have some weaknesses of the corresponding granularity. In this paper, we first unify token and span level supervisions and propose a Consistent Dual Adaptive Prototypical (CDAP) network for few-shot sequence labeling. CDAP contains the token-level and span-level networks, jointly trained at different granularities. To align the outputs of two networks, we further propose a consistent loss to enable them to learn from each other. During the inference phase, we propose a consistent greedy inference algorithm that first adjusts the predicted probability and then greedily selects non-overlapping spans with maximum probability. Extensive experiments show that our model achieves new state-of-the-art results on three benchmark datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/04/2018

A Span Selection Model for Semantic Role Labeling

We present a simple and accurate span-based model for semantic role labe...
research
10/23/2022

Span-based joint entity and relation extraction augmented with sequence tagging mechanism

Span-based joint extraction simultaneously conducts named entity recogni...
research
03/21/2022

Effective Token Graph Modeling using a Novel Labeling Strategy for Structured Sentiment Analysis

The state-of-the-art model for structured sentiment analysis casts the t...
research
05/21/2021

Boosting Span-based Joint Entity and Relation Extraction via Squence Tagging Mechanism

Span-based joint extraction simultaneously conducts named entity recogni...
research
09/27/2021

An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling

Few-Shot Sequence Labeling (FSSL) is a canonical solution for the taggin...
research
05/28/2021

Accelerating BERT Inference for Sequence Labeling via Early-Exit

Both performance and efficiency are crucial factors for sequence labelin...
research
05/06/2018

Zero-shot Sequence Labeling: Transferring Knowledge from Sentences to Tokens

Can attention- or gradient-based visualization techniques be used to inf...

Please sign up or login with your details

Forgot password? Click here to reset