Exact model comparisons in the plausibility framework

11/01/2019
by   Stefan Böhringer, et al.
0

Plausibility is a formalization of exact tests for parametric models and generalizes procedures such as Fisher's exact test. The resulting tests are based on cumulative probabilies of the probability density function and have a goodness-of-fit interpretation with exact control of the α level for finite sample size. Model comparisons are not possible in this approach. We generalize plausibility by incorporating weighing which allows to perform model comparisons. We show that one weighing scheme is asymptotically equivalent to the likelihood ratio test (LRT) and has finite sample guarantees for the test size under the null hypothesis unlike the LRT. We confirm theoretical properties in simulations that mimic the data set of our data application. We apply the method to a retinoblastoma data set and demonstrate a parent-of-origin effect. Weighted plausibility also has applications in high-dimensional data analysis and P-values for penalized regression models can be derived. We demonstrate superior performance as compared to a data-splitting procedure in a simulation study. We apply weighted plausibility to a high-dimensional gene expression, case-control prostate cancer data set. We discuss the flexibility of the approach by relating weighted plausibility to targeted learning, the bootstrap, and sparsity selection.

READ FULL TEXT
research
05/17/2018

High-dimensional doubly robust tests for regression parameters

After variable selection, standard inferential procedures for regression...
research
03/06/2023

Model checking for high-dimensional parametric regressions: the conditionally studentized test

This paper studies model checking for general parametric regression mode...
research
08/16/2018

Wild bootstrap logrank tests with broader power functions for testing superiority

We introduce novel wild bootstrap procedures for testing superiority in ...
research
10/24/2022

E-Valuating Classifier Two-Sample Tests

We propose E-C2ST, a classifier two-sample test for high-dimensional dat...
research
10/07/2019

Where to find needles in a haystack?

In many existing methods in multiple comparison, one starts with either ...
research
05/04/2023

Credibility of high R^2 in regression problems: a permutation approach

The question of whether Y can be predicted based on X often arises and w...
research
01/06/2023

Rank-transformed subsampling: inference for multiple data splitting and exchangeable p-values

Many testing problems are readily amenable to randomised tests such as t...

Please sign up or login with your details

Forgot password? Click here to reset