Subset selection for linear mixed models

07/27/2021
by   Daniel R. Kowal, et al.
0

Linear mixed models (LMMs) are instrumental for regression analysis with structured dependence, such as grouped, clustered, or multilevel data. However, selection among the covariates–while accounting for this structured dependence–remains a challenge. We introduce a Bayesian decision analysis for subset selection with LMMs. Using a Mahalanobis loss function that incorporates the structured dependence, we derive optimal linear actions for any subset of covariates and under any Bayesian LMM. Crucially, these actions inherit shrinkage or regularization and uncertainty quantification from the underlying Bayesian LMM. Rather than selecting a single "best" subset, which is often unstable and limited in its information content, we collect the acceptable family of subsets that nearly match the predictive ability of the "best" subset. The acceptable family is summarized by its smallest member and key variable importance metrics. Customized subset search and out-of-sample approximation algorithms are provided for more scalable computing. These tools are applied to simulated data and a longitudinal physical activity dataset, and in both cases demonstrate excellent prediction, estimation, and selection ability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2021

Bayesian subset selection and variable importance for interpretable prediction and classification

Subset selection is a valuable tool for interpretable learning, scientif...
research
06/23/2020

Fast, Optimal, and Targeted Predictions using Parametrized Decision Analysis

Prediction is critical for decision-making under uncertainty and lends v...
research
10/19/2015

Piecewise-Linear Approximation for Feature Subset Selection in a Sequential Logit Model

This paper concerns a method of selecting a subset of features for a seq...
research
06/13/2012

Observation Subset Selection as Local Compilation of Performance Profiles

Deciding what to sense is a crucial task, made harder by dependencies an...
research
12/10/2020

Optimal selection of a common subset of covariates for different regressions

Given a regression dataset of size n, most of the classical model select...
research
07/10/2022

Energy Trees: Regression and Classification With Structured and Mixed-Type Covariates

The continuous growth of data complexity requires methods and models tha...
research
06/01/2020

METASET: Exploring Shape and Property Spaces for Data-Driven Metamaterials Design

Data-driven design of mechanical metamaterials is an increasingly popula...

Please sign up or login with your details

Forgot password? Click here to reset