Classification and clustering for samples of event time data using non-homogeneous Poisson process models

03/06/2017
by   Duncan Barrack, et al.
0

Data of the form of event times arise in various applications. A simple model for such data is a non-homogeneous Poisson process (NHPP) which is specified by a rate function that depends on time. We consider the problem of having access to multiple independent samples of event time data, observed on a common interval, from which we wish to classify or cluster the samples according to their rate functions. Each rate function is unknown but assumed to belong to a small set of rate functions defining distinct classes. We model the rate functions using a spline basis expansion, the coefficients of which need to be estimated from data. The classification approach consists of using training data for which the class membership is known and to calculate maximum likelihood estimates of the coefficients for each group, then assigning test samples to a class by a maximum likelihood criterion. For clustering, by analogy to the Gaussian mixture model approach for Euclidean data, we consider a mixture of NHPP models and use the expectation maximisation algorithm to estimate the coefficients of the rate functions for the component models and probability of membership for each sample to each model. The classification and clustering approaches perform well on both synthetic and real-world data sets considered. Code associated with this paper is available at https://github.com/duncan-barrack/NHPP .

READ FULL TEXT

page 10

page 11

page 12

page 13

page 14

page 18

research
10/03/2020

EGMM: an Evidential Version of the Gaussian Mixture Model for Clustering

The Gaussian mixture model (GMM) provides a convenient yet principled fr...
research
03/02/2018

Estimation of Poisson arrival processes under linear models

In this paper we consider the problem of estimating the parameters of a ...
research
11/16/2019

Maximum Approximate Likelihood Estimation in Accelerated Failure Time Model for Interval-Censored Data

The approximate Bernstein polynomial model, a mixture of beta distributi...
research
12/25/2013

Model-based clustering and segmentation of time series with changes in regime

Mixture model-based clustering, usually applied to multidimensional data...
research
04/16/2021

Interval-censored Hawkes processes

This work builds a novel point process and tools to use the Hawkes proce...
research
11/11/2014

Supervised Classification of Flow Cytometric Samples via the Joint Clustering and Matching (JCM) Procedure

We consider the use of the Joint Clustering and Matching (JCM) procedure...
research
03/04/2022

False clustering rate control in mixture models

The clustering task consists in delivering labels to the members of a sa...

Please sign up or login with your details

Forgot password? Click here to reset