Probabilistic Imputation for Time-series Classification with Missing Data

08/13/2023
by   SeungHyun Kim, et al.
0

Multivariate time series data for real-world applications typically contain a significant amount of missing values. The dominant approach for classification with such missing values is to impute them heuristically with specific values (zero, mean, values of adjacent time-steps) or learnable parameters. However, these simple strategies do not take the data generative process into account, and more importantly, do not effectively capture the uncertainty in prediction due to the multiple possibilities for the missing values. In this paper, we propose a novel probabilistic framework for classification with multivariate time series data with missing values. Our model consists of two parts; a deep generative model for missing value imputation and a classifier. Extending the existing deep generative models to better capture structures of time-series data, our deep generative model part is trained to impute the missing values in multiple plausible ways, effectively modeling the uncertainty of the imputation. The classifier part takes the time series data along with the imputed missing values and classifies signals, and is trained to capture the predictive uncertainty due to the multiple possibilities of imputations. Importantly, we show that naïvely combining the generative model and the classifier could result in trivial solutions where the generative model does not produce meaningful imputations. To resolve this, we present a novel regularization technique that can promote the model to produce useful imputation values that help classification. Through extensive experiments on real-world time series data with missing values, we demonstrate the effectiveness of our method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2018

BRITS: Bidirectional Recurrent Imputation for Time Series

Time series are widely used as signals in many classification/regression...
research
07/19/2023

Sig-Splines: universal approximation and convex calibration of time series generative models

We propose a novel generative model for multivariate discrete-time time ...
research
04/30/2019

Multi-resolution Networks For Flexible Irregular Time Series Modeling (Multi-FIT)

Missing values, irregularly collected samples, and multi-resolution sign...
research
08/11/2018

A Consistent Method for Learning OOMs from Asymptotically Stationary Time Series Data Containing Missing Values

In the traditional framework of spectral learning of stochastic time ser...
research
08/05/2018

Missing Value Imputation Based on Deep Generative Models

Missing values widely exist in many real-world datasets, which hinders t...
research
05/06/2020

Deep Recurrent Disease Progression Model for Conversion-Time Prediction of Alzheimer's Disease

Alzheimer's disease (AD) is known as one of the major causes of dementia...
research
06/21/2020

VAEM: a Deep Generative Model for Heterogeneous Mixed Type Data

Deep generative models often perform poorly in real-world applications d...

Please sign up or login with your details

Forgot password? Click here to reset