DeepAI AI Chat
Log In Sign Up

Robust Sparse Bayesian Infinite Factor Models

by   Jaejoon Lee, et al.

Most of previous works and applications of Bayesian factor model have assumed the normal likelihood regardless of its validity. We propose a Bayesian factor model for heavy-tailed high-dimensional data based on multivariate Student-t likelihood to obtain better covariance estimation. We use multiplicative gamma process shrinkage prior and factor number adaptation scheme proposed in Bhattacharya Dunson [Biometrika (2011) 291-306]. Since a naive Gibbs sampler for the proposed model suffers from slow mixing, we propose a Markov Chain Monte Carlo algorithm where fast mixing of Hamiltonian Monte Carlo is exploited for some parameters in proposed model. Simulation results illustrate the gain in performance of covariance estimation for heavy-tailed high-dimensional data. We also provide a theoretical result that the posterior of the proposed model is weakly consistent under reasonable conditions. We conclude the paper with the application of proposed factor model on breast cancer metastasis prediction given DNA signature data of cancer cell.


Fast Variational Inference for Bayesian Factor Analysis in Single and Multi-Study Settings

Factors models are routinely used to analyze high-dimensional data in bo...

Efficient posterior sampling for high-dimensional imbalanced logistic regression

High-dimensional data are routinely collected in many application areas....

Adaptive Bayesian Variable Clustering via Structural Learning of Breast Cancer Data

Clustering of proteins is of interest in cancer cell biology. This artic...

Stratified stochastic variational inference for high-dimensional network factor model

There has been considerable recent interest in Bayesian modeling of high...

Dynamic Factor Analysis with Dependent Gaussian Processes for High-Dimensional Gene Expression Trajectories

The increasing availability of high-dimensional, longitudinal measures o...

Bayesian Distance Weighted Discrimination

Distance weighted discrimination (DWD) is a linear discrimination method...

Bayesian time-aligned factor analysis of paired multivariate time series

Many modern data sets require inference methods that can estimate the sh...

Code Repositories


An Rcpp-based R program for massive covariance estimation using robust sparse Bayesian infinite factor models (Lee & Lee, 2020)

view repo