Data aggregation can lead to biased inferences in Bayesian linear mixed models

03/04/2022
by   Daniel J. Schad, et al.
0

Bayesian linear mixed-effects models are increasingly being used in the cognitive sciences to perform null hypothesis tests, where a null hypothesis that an effect is zero is compared with an alternative hypothesis that the effect exists and is different from zero. While software tools for Bayes factor null hypothesis tests are easily accessible, how to specify the data and the model correctly is often not clear. In Bayesian approaches, many authors recommend data aggregation at the by-subject level and running Bayes factors on aggregated data. Here, we use simulation-based calibration for model inference to demonstrate that null hypothesis tests can yield biased Bayes factors, when computed from aggregated data. Specifically, when random slope variances differ (i.e., violated sphericity assumption), Bayes factors are too conservative for contrasts where the variance is small and they are too liberal for contrasts where the variance is large. Moreover, Bayes factors for by-subject aggregated data are biased (too liberal) when random item variance is present but ignored in the analysis. We also perform corresponding frequentist analyses (type I and II error probabilities) to illustrate that the same problems exist and are well known from frequentist tools. These problems can be circumvented by running Bayesian linear mixed-effects models on non-aggregated data such as on individual trials and by explicitly modeling the full random effects structure. Reproducible code is available from https://osf.io/mjf47/.

READ FULL TEXT
research
09/30/2022

Bayes factor functions for reporting outcomes of hypothesis tests

Bayes factors represent the ratio of probabilities assigned to data by c...
research
02/14/2021

Bayes Factors for Peri-Null Hypotheses

A perennial objection against Bayes factor point-null hypothesis tests i...
research
03/15/2021

Workflow Techniques for the Robust Use of Bayes Factors

Inferences about hypotheses are ubiquitous in the cognitive sciences. Ba...
research
03/21/2019

Variational Bayesian modelling of mixed-effects

This note is concerned with an accurate and computationally efficient va...
research
01/17/2019

Score-based Tests for Explaining Upper-Level Heterogeneity in Linear Mixed Models

Cross-level interactions among fixed effects in linear mixed models (als...
research
04/22/2022

Bayesian mixed-effect models for independent dynamic social network data

Relational event or time-stamped social network data have become increas...
research
02/23/2018

Bayesian Semiparametric Functional Mixed Models for Serially Correlated Functional Data, with Application to Glaucoma Data

Glaucoma, a leading cause of blindness, is characterized by optic nerve ...

Please sign up or login with your details

Forgot password? Click here to reset