Bayesian Logistic Regression for Small Areas with Numerous Households

06/01/2018
by   Balgobin Nandram, et al.
0

We analyze binary data, available for a relatively large number (big data) of families (or households), which are within small areas, from a population-based survey. Inference is required for the finite population proportion of individuals with a specific character for each area. To accommodate the binary data and important features of all sampled individuals, we use a hierarchical Bayesian logistic regression model with each family (not area) having its own random effect. This modeling helps to correct for overshrinkage so common in small area estimation. Because there are numerous families, the computational time on the joint posterior density using standard Markov chain Monte Carlo (MCMC) methods is prohibitive. Therefore, the joint posterior density of the hyper-parameters is approximated using an integrated nested normal approximation (INNA) via the multiplication rule. This approach provides a sampling-based method that permits fast computation, thereby avoiding very time-consuming MCMC methods. Then, the random effects are obtained from the exact conditional posterior density using parallel computing. The unknown nonsample features and household sizes are obtained using a nested Bayesian bootstrap that can be done using parallel computing as well. For relatively small data sets (e.g., 5000 families), we compare our method with a MCMC method to show that our approach is reasonable. We discuss an example on health severity using the Nepal Living Standards Survey (NLSS).

READ FULL TEXT

page 34

page 35

page 36

research
12/16/2017

Parallel Markov Chain Monte Carlo for Bayesian Hierarchical Models with Big Data, in Two Stages

Due to the escalating growth of big data sets in recent years, new paral...
research
07/22/2008

Inference with Discriminative Posterior

We study Bayesian discriminative inference given a model family p(c,, θ)...
research
04/24/2023

Exact Bayesian Geostatistics Using Predictive Stacking

We develop Bayesian predictive stacking for geostatistical models. Our a...
research
01/15/2018

Divide and Recombine for Large and Complex Data: Model Likelihood Functions using MCMC

In Divide & Recombine (D&R), big data are divided into subsets, each ana...
research
08/30/2022

Bayesian Multinomial Logistic Regression for Numerous Categories

While multinomial logistic regression is a useful tool for classificatio...
research
11/08/2021

Bayesian modelling of statistical region- and family-level clustered ordinal outcome data from Turkey

This study is concerned with the analysis of three-level ordinal outcome...

Please sign up or login with your details

Forgot password? Click here to reset