Margin-free classification and new class detection using finite Dirichlet mixtures

03/25/2021
by   Prince John, et al.
0

We present a margin-free finite mixture model which allows us to simultaneously classify objects into known classes and to identify possible new object types using a set of continuous attributes. This application is motivated by the needs of identifying and possibly detecting new types of a particular kind of stars known as variable stars. We first suitably transform the physical attributes of the stars onto the simplex to achieve scale invariance while maintaining their dependence structure. This allows us to compare data collected by different sky surveys which can have different scales. The model hence combines a mixture of Dirichlet mixtures to represent the known classes with the semi-supervised classification strategy of Vatanen et al. (2012) for outlier detection. In line with previous work on semiparametric model-based clustering, the single Dirichlet distributions can be seen as providing the baseline pattern of the data. These are then combined to effectively model the complex distributions of the attributes for the different classes. The model is estimated using a hierarchical two-step procedure which combines a suitably adapted version of the Expectation-Maximization (EM) algorithm with Bayes' rule. We validate our model on a reliable sample of periodic variable stars available in the literature (Dubath et al., 2011) achieving an overall classification accuracy of 71.95 sensitivity of 86.11

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/11/2020

Infinite mixtures of multivariate normal-inverse Gaussian distributions for clustering of skewed data

Mixtures of multivariate normal inverse Gaussian (MNIG) distributions ca...
research
02/19/2019

On the Convergence of EM for truncated mixtures of two Gaussians

Motivated by a recent result of Daskalakis et al. DGTZ18, we analyze the...
research
05/20/2020

The semi-hierarchical Dirichlet Process and its application to clustering homogeneous distributions

Assessing homogeneity of distributions is an old problem that has receiv...
research
04/13/2020

Estimation of Classification Rules from Partially Classified Data

We consider the situation where the observed sample contains some observ...
research
12/25/2013

Mixture model-based functional discriminant analysis for curve classification

Statistical approaches for Functional Data Analysis concern the paradigm...
research
12/04/2022

Variational Inference for Semiparametric Bayesian Novelty Detection in Large Datasets

After being trained on a fully-labeled training set, where the observati...
research
12/02/2015

Object-based World Modeling in Semi-Static Environments with Dependent Dirichlet-Process Mixtures

To accomplish tasks in human-centric indoor environments, robots need to...

Please sign up or login with your details

Forgot password? Click here to reset