A Dirichlet Process Mixture Model for Directional-Linear Data

12/21/2022
by   Tong Zou, et al.
0

Directional data require specialized probability models because of the non-Euclidean and periodic nature of their domain. When a directional variable is observed jointly with linear variables, modeling their dependence adds an additional layer of complexity. This paper introduces a novel Bayesian nonparametric approach for directional-linear data based on the Dirichlet process. We first extend the projected normal distribution to model the joint distribution of linear variables and a directional variable with arbitrary dimension as a projection of a higher-dimensional augmented multivariate normal distribution (MVN). We call the new distribution the semi-projected normal distribution (SPN); it possesses properties similar to the MVN. The SPN is then used as the mixture distribution in a Dirichlet process model to obtain a more flexible class of models for directional-linear data. We propose a normal conditional inverse-Wishart distribution as part of the prior distribution to address an identifiability issue inherited from the projected normal and preserve conjugacy with the SPN distribution. A Gibbs sampling algorithm is provided for posterior inference. Experiments on synthetic data and the Berkeley image database show superior performance of the Dirichlet process SPN mixture model (DPSPN) in clustering compared to other directional-linear models. We also build a hierarchical Dirichlet process model with the SPN to develop a likelihood ratio approach to bloodstain pattern analysis using the DPSPN model for density estimation to estimate the likelihood of a given pattern from a set of training data.

READ FULL TEXT

page 10

page 12

research
10/22/2020

A Normal-Gamma Dirichlet Process Mixture Model

We propose a Dirichlet process mixture (DPM) for prediction and cluster-...
research
11/21/2017

The joint projected normal and skew-normal: a distribution for poly-cylindrical data

The contribution of this work is the introduction of a multivariate circ...
research
04/26/2022

Multivariate and regression models for directional data based on projected Pólya trees

Projected distributions have proved to be useful in the study of circula...
research
08/16/2021

Hierarchical Infinite Relational Model

This paper describes the hierarchical infinite relational model (HIRM), ...
research
10/12/2019

An Imputation model by Dirichlet Process Mixture of Elliptical Copulas for Data of Mixed Type

Copula-based methods provide a flexible approach to build missing data i...
research
12/03/2015

CrossCat: A Fully Bayesian Nonparametric Method for Analyzing Heterogeneous, High Dimensional Data

There is a widespread need for statistical methods that can analyze high...
research
08/12/2018

A New Look at F-Tests

Directional inference for vector parameters based on higher order approx...

Please sign up or login with your details

Forgot password? Click here to reset