Applying Discrete PCA in Data Analysis

07/11/2012
by   Wray L. Buntine, et al.
0

Methods for analysis of principal components in discrete data have existed for some time under various names such as grade of membership modelling, probabilistic latent semantic analysis, and genotype inference with admixture. In this paper we explore a number of extensions to the common theory, and present some application of these methods to some common statistical tasks. We show that these methods can be interpreted as a discrete version of ICA. We develop a hierarchical version yielding components at different levels of detail, and additional techniques for Gibbs sampling. We compare the algorithms on a text prediction task using support vector machines, and to information retrieval.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2019

Distributed estimation of principal support vector machines for sufficient dimension reduction

The principal support vector machines method (Li et al., 2011) is a powe...
research
03/07/2019

Quantum Latent Semantic Analysis

The main goal of this paper is to explore latent topic analysis (LTA), i...
research
11/25/2019

ROIPCA: An Online PCA algorithm based on rank-one updates

Principal components analysis (PCA) is a fundamental algorithm in data a...
research
04/14/2019

Probabilistic Kernel Support Vector Machines

We propose a probabilistic enhancement of standard kernel Support Vecto...
research
08/12/2016

Content-based image retrieval tutorial

This paper functions as a tutorial for individuals interested to enter t...
research
12/30/2009

Computing Principal Components Dynamically

In this paper we present closed-form solutions for efficiently updating ...
research
10/05/2007

Semantic distillation: a method for clustering objects by their contextual specificity

Techniques for data-mining, latent semantic analysis, contextual search ...

Please sign up or login with your details

Forgot password? Click here to reset