Effects of Additional Data on Bayesian Clustering

07/13/2016
by   Keisuke Yamazaki, et al.
0

Hierarchical probabilistic models, such as mixture models, are used for cluster analysis. These models have two types of variables: observable and latent. In cluster analysis, the latent variable is estimated, and it is expected that additional information will improve the accuracy of the estimation of the latent variable. Many proposed learning methods are able to use additional data; these include semi-supervised learning and transfer learning. However, from a statistical point of view, a complex probabilistic model that encompasses both the initial and additional data might be less accurate due to having a higher-dimensional parameter. The present paper presents a theoretical analysis of the accuracy of such a model and clarifies which factor has the greatest effect on its accuracy, the advantages of obtaining additional data, and the disadvantages of increasing the complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2013

Accuracy of Latent-Variable Estimation in Bayesian Semi-Supervised Learning

Hierarchical probabilistic models, such as Gaussian mixture models, are ...
research
10/05/2015

Bayesian Estimation of Multidimensional Latent Variables and Its Asymptotic Accuracy

Hierarchical learning models, such as mixture models and Bayesian networ...
research
05/15/2012

Asymptotic Accuracy of Bayes Estimation for Latent Variables with Redundancy

Hierarchical parametric models consisting of observable and latent varia...
research
11/10/2015

Anchored Discrete Factor Analysis

We present a semi-supervised learning algorithm for learning discrete fa...
research
11/02/2017

Overcoming data scarcity with transfer learning

Despite increasing focus on data publication and discovery in materials ...
research
06/01/2020

Correcting misclassification errors in crowdsourced ecological data: A Bayesian perspective

Many research domains use data elicited from "citizen scientists" when a...
research
12/17/2012

A Tutorial on Probabilistic Latent Semantic Analysis

In this tutorial, I will discuss the details about how Probabilistic Lat...

Please sign up or login with your details

Forgot password? Click here to reset