Streaming Inference for Infinite Non-Stationary Clustering

05/02/2022
by   Rylan Schaeffer, et al.
0

Learning from a continuous stream of non-stationary data in an unsupervised manner is arguably one of the most common and most challenging settings facing intelligent agents. Here, we attack learning under all three conditions (unsupervised, streaming, non-stationary) in the context of clustering, also known as mixture modeling. We introduce a novel clustering algorithm that endows mixture models with the ability to create new clusters online, as demanded by the data, in a probabilistic, time-varying, and principled manner. To achieve this, we first define a novel stochastic process called the Dynamical Chinese Restaurant Process (Dynamical CRP), which is a non-exchangeable distribution over partitions of a set; next, we show that the Dynamical CRP provides a non-stationary prior over cluster assignments and yields an efficient streaming variational inference algorithm. We conclude with experiments showing that the Dynamical CRP can be applied on diverse synthetic and real data with Gaussian and non-Gaussian likelihoods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2022

Dirichlet process mixture models for non-stationary data streams

In recent years, we have seen a handful of work on inference algorithms ...
research
10/10/2018

Harmonizable mixture kernels with variational Fourier features

The expressive power of Gaussian processes depends heavily on the choice...
research
04/24/2018

Learning Manifolds from Non-stationary Streaming Data

Streaming adaptations of manifold learning based dimensionality reductio...
research
02/07/2019

Online Clustering by Penalized Weighted GMM

With the dawn of the Big Data era, data sets are growing rapidly. Data i...
research
10/28/2016

Adaptive regularization for Lasso models in the context of non-stationary data streams

Large scale, streaming datasets are ubiquitous in modern machine learnin...
research
09/08/2015

Modelling time evolving interactions in networks through a non stationary extension of stochastic block models

In this paper, we focus on the stochastic block model (SBM),a probabilis...
research
09/25/2020

Towards the interpretation of time-varying regularization parameters in streaming penalized regression models

High-dimensional, streaming datasets are ubiquitous in modern applicatio...

Please sign up or login with your details

Forgot password? Click here to reset