iobes: A Library for Span-Level Processing

10/09/2020
by   Brian Lester, et al.
0

Many tasks in natural language processing, such as named entity recognition and slot-filling, involve identifying and labeling specific spans of text. In order to leverage common models, these tasks are often recast as sequence labeling tasks. Each token is given a label and these labels are prefixed with special tokens such as B- or I-. After a model assigns labels to each token, these prefixes are used to group the tokens into spans. Properly parsing these annotations is critical for producing fair and comparable metrics; however, despite its importance, there is not an easy-to-use, standardized, programmatically integratable library to help work with span labeling. To remedy this, we introduce our open-source library, iobes. iobes is used for parsing, converting, and processing spans represented as token level decisions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2020

An Analysis of Simple Data Augmentation for Named Entity Recognition

Simple yet effective data augmentation techniques have been proposed for...
research
06/02/2020

Embeddings of Label Components for Sequence Labeling: A Case Study of Fine-grained Named Entity Recognition

In general, the labels used in sequence labeling consist of different ty...
research
10/05/2020

SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Active learning is an important technique for low-resource sequence labe...
research
12/21/2022

How Does Beam Search improve Span-Level Confidence Estimation in Generative Sequence Labeling?

Text-to-text generation models have increasingly become the go-to soluti...
research
10/08/2022

Detecting Label Errors in Token Classification Data

Mislabeled examples are a common issue in real-world data, particularly ...
research
05/08/2023

Token-level Fitting Issues of Seq2seq Models

Sequence-to-sequence (seq2seq) models have been widely used for natural ...
research
04/11/2023

flap: A Deterministic Parser with Fused Lexing

Lexers and parsers are typically defined separately and connected by a t...

Please sign up or login with your details

Forgot password? Click here to reset