Few-Shot Sequence Labeling with Label Dependency Transfer

by   Yutai Hou, et al.

Few-shot sequence labeling faces a unique challenge compared with the other fewshot classification problems, owing to the necessity for modeling the dependencies between labels. Different domains often have different label sets, which makes it difficult to directly utilize the label dependencies learned from one domain in another domain. In this paper, we introduce the dependency transfer mechanism that addresses such label-discrepancy problem. The dependency transfer mechanism learns the abstract label transition patterns from the source domains and generalizes such patterns in the target domain to benefit the prediction of a label sequence. We also develop the sequence matching network by adapting the matching network to sequence labeling case. Moreover, we propose a CRF-based few-shot sequence labeling framework to integrate both the dependency transfer mechanism and the sequence matching network. Experiments on slot tagging (ST) and named entity recognition (NER) datasets show that our model significantly outperforms the strongest few-shot learning baseline by 7.96 and 11.70 F1 scores respectively in the 1-shot setting.



page 1

page 2

page 3

page 4


Few-shot Slot Tagging with Collapsed Dependency Transfer and Label-enhanced Task-adaptive Projection Network

In this paper, we explore the slot tagging with only a few labeled suppo...

Hierarchically-Refined Label Attention Network for Sequence Labeling

CRF has been used as a powerful model for statistical sequence labeling....

Adaptive Self-training for Few-shot Neural Sequence Labeling

Neural sequence labeling is an important technique employed for many Nat...

Label-Agnostic Sequence Labeling by Copying Nearest Neighbors

Retrieve-and-edit based approaches to structured prediction, where struc...

Dependency Structure Misspecification in Multi-Source Weak Supervision Models

Data programming (DP) has proven to be an attractive alternative to cost...

Few-Shot Event Detection with Prototypical Amortized Conditional Random Field

Event Detection, a fundamental task of Information Extraction, tends to ...

Augmented Natural Language for Generative Sequence Labeling

We propose a generative framework for joint sequence labeling and senten...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.