Labeled Optimal Partitioning

06/24/2020
by   Toby Dylan Hocking, et al.
0

In data sequences measured over space or time, an important problem is accurate detection of abrupt changes. In partially labeled data, it is important to correctly predict presence/absence of changes in positive/negative labeled regions, in both the train and test sets. One existing dynamic programming algorithm is designed for prediction in unlabeled test regions (and ignores the labels in the train set); another is for accurate fitting of train labels (but does not predict changepoints in unlabeled test regions). We resolve these issues by proposing a new optimal changepoint detection model that is guaranteed to fit the labels in the train data, and can also provide predictions of unlabeled changepoints in test data. We propose a new dynamic programming algorithm, Labeled Optimal Partitioning (LOPART), and we provide a formal proof that it solves the resulting non-convex optimization problem. We provide theoretical and empirical analysis of the time complexity of our algorithm, in terms of the number of labels and the size of the data sequence to segment. Finally, we provide empirical evidence that our algorithm is more accurate than the existing baselines, in terms of train and test label error.

READ FULL TEXT
research
10/05/2022

Functional Labeled Optimal Partitioning

Peak detection is a problem in sequential data analysis that involves di...
research
10/26/2021

DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples

The scarcity of labeled data is a critical obstacle to deep learning. Se...
research
12/30/2019

End-to-end Learning, with or without Labels

We present an approach for end-to-end learning that allows one to jointl...
research
05/03/2007

Multiresolution Approximation of Polygonal Curves in Linear Complexity

We propose a new algorithm to the problem of polygonal curve approximati...
research
03/05/2020

Linear time dynamic programming for the exact path of optimal models selected from a finite set

Many learning algorithms are formulated in terms of finding model parame...
research
03/18/2018

A Robust AUC Maximization Framework with Simultaneous Outlier Detection and Feature Selection for Positive-Unlabeled Classification

The positive-unlabeled (PU) classification is a common scenario in real-...
research
03/25/2021

Prediction in the presence of response-dependent missing labels

In a variety of settings, limitations of sensing technologies or other s...

Please sign up or login with your details

Forgot password? Click here to reset