The supervised hierarchical Dirichlet process

12/17/2014
by   Andrew M. Dai, et al.
0

We propose the supervised hierarchical Dirichlet process (sHDP), a nonparametric generative model for the joint distribution of a group of observations and a response variable directly associated with that whole group. We compare the sHDP with another leading method for regression on grouped data, the supervised latent Dirichlet allocation (sLDA) model. We evaluate our method on two real-world classification problems and two real-world regression problems. Bayesian nonparametric regression models based on the Dirichlet process, such as the Dirichlet process-generalised linear models (DP-GLM) have previously been explored; these models allow flexibility in modelling nonlinear relationships. However, until now, Hierarchical Dirichlet Process (HDP) mixtures have not seen significant use in supervised problems with grouped data since a straightforward application of the HDP on the grouped data results in learnt clusters that are not predictive of the responses. The sHDP solves this problem by allowing for clusters to be learnt jointly from the group structure and from the label assigned to each group.

READ FULL TEXT
research
09/28/2009

Dirichlet Process Mixtures of Generalized Linear Models

We propose Dirichlet Process mixtures of Generalized Linear Models (DP-G...
research
10/24/2016

Bayesian Nonparametric Modeling of Heterogeneous Groups of Censored Data

Datasets containing large samples of time-to-event data arising from sev...
research
08/20/2018

Bayesian Regression for a Dirichlet Distributed Response using Stan

For an observed response that is composed by a set - or vector - of posi...
research
07/12/2020

The Dependent Dirichlet Process and Related Models

Standard regression approaches assume that some finite number of the res...
research
07/18/2017

Cooperative Hierarchical Dirichlet Processes: Superposition vs. Maximization

The cooperative hierarchical structure is a common and significant data ...
research
09/22/2016

Nonparametric Bayesian Topic Modelling with the Hierarchical Pitman-Yor Processes

The Dirichlet process and its extension, the Pitman-Yor process, are sto...
research
07/30/2021

Bayesian Nonparametric Classification for Incomplete Data With a High Missing Rate: an Application to Semiconductor Manufacturing Data

During the semiconductor manufacturing process, predicting the yield of ...

Please sign up or login with your details

Forgot password? Click here to reset