Partial sequence labeling with structured Gaussian Processes

09/20/2022
by   Xiaolei Lu, et al.
0

Existing partial sequence labeling models mainly focus on max-margin framework which fails to provide an uncertainty estimation of the prediction. Further, the unique ground truth disambiguation strategy employed by these models may include wrong label information for parameter learning. In this paper, we propose structured Gaussian Processes for partial sequence labeling (SGPPSL), which encodes uncertainty in the prediction and does not need extra effort for model selection and hyperparameter learning. The model employs factor-as-piece approximation that divides the linear-chain graph structure into the set of pieces, which preserves the basic Markov Random Field structure and effectively avoids handling large number of candidate output sequences generated by partially annotated data. Then confidence measure is introduced in the model to address different contributions of candidate labels, which enables the ground-truth label information to be utilized in parameter learning. Based on the derived lower bound of the variational lower bound of the proposed model, variational parameters and confidence measures are estimated in the framework of alternating optimization. Moreover, weighted Viterbi algorithm is proposed to incorporate confidence measure to sequence prediction, which considers label ambiguity arose from multiple annotations in the training data and thus helps improve the performance. SGPPSL is evaluated on several sequence labeling tasks and the experimental results show the effectiveness of the proposed model.

READ FULL TEXT
research
09/20/2022

Weak Disambiguation for Partial Structured Output Learning

Existing disambiguation strategies for partial structured output learnin...
research
09/20/2022

Modeling sequential annotations for sequence labeling with crowds

Crowd sequential annotations can be an efficient and cost-effective way ...
research
01/04/2023

Learning Ambiguity from Crowd Sequential Annotations

Most crowdsourcing learning methods treat disagreement between annotator...
research
12/25/2014

Gaussian Process Pseudo-Likelihood Models for Sequence Labeling

Several machine learning problems arising in natural language processing...
research
06/03/2019

HERA: Partial Label Learning by Combining Heterogeneous Loss with Sparse and Low-Rank Regularization

Partial Label Learning (PLL) aims to learn from the data where each trai...
research
09/07/2020

Iterative Correction of Sensor Degradation and a Bayesian Multi-Sensor Data Fusion Method

We present a novel method for inferring ground-truth signal from multipl...
research
06/27/2012

Structured Learning from Partial Annotations

Structured learning is appropriate when predicting structured outputs su...

Please sign up or login with your details

Forgot password? Click here to reset