Gaussian process modelling of multiple short time series

10/09/2012
by   Hande Topa, et al.
0

We present techniques for effective Gaussian process (GP) modelling of multiple short time series. These problems are common when applying GP models independently to each gene in a gene expression time series data set. Such sets typically contain very few time points. Naive application of common GP modelling techniques can lead to severe over-fitting or under-fitting in a significant fraction of the fitted models, depending on the details of the data set. We propose avoiding over-fitting by constraining the GP length-scale to values that focus most of the energy spectrum to frequencies below the Nyquist frequency corresponding to the sampling frequency in the data set. Under-fitting can be avoided by more informative priors on observation noise. Combining these methods allows applying GP methods reliably automatically to large numbers of independent instances of short time series. This is illustrated with experiments with both synthetic data and real gene expression data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2020

Energy consumption forecasting using a stacked nonparametric Bayesian approach

In this paper, the process of forecasting household energy consumption i...
research
01/27/2020

Bayesian nonparametric shared multi-sequence time series segmentation

In this paper, we introduce a method for segmenting time series data usi...
research
02/24/2021

Similarity measure for sparse time course data based on Gaussian processes

We propose a similarity measure for sparsely sampled time course data in...
research
01/07/2020

Scalable Hybrid HMM with Gaussian Process Emission for Sequential Time-series Data Clustering

Hidden Markov Model (HMM) combined with Gaussian Process (GP) emission c...
research
01/08/2014

Fast nonparametric clustering of structured time-series

In this publication, we combine two Bayesian non-parametric models: the ...
research
11/03/2021

Local Structure and effective Dimensionality of Time Series Data Sets

The goal of this paper is to develop novel tools for understanding the l...
research
12/21/2020

Statistical Modelling and Analysis of the Computer-Simulated Datasets

Over the last two decades, the science has come a long way from relying ...

Please sign up or login with your details

Forgot password? Click here to reset