On the Hauck-Donner Effect in Wald Tests: Detection, Tipping Points, and Parameter Space Characterization

01/23/2020
by   Thomas William Yee, et al.
0

The Wald test remains ubiquitous in statistical practice despite shortcomings such as its inaccuracy in small samples and lack of invariance under reparameterization. This paper develops on another but lesser-known shortcoming called the Hauck–Donner effect (HDE) whereby a Wald test statistic is not monotonely increasing as a function of increasing distance between the parameter estimate and the null value. Resulting in an upward biased p-value and loss of power, the aberration can lead to very damaging consequences such as in variable selection. The HDE afflicts many types of regression models and corresponds to estimates near the boundary of the parameter space. This article presents several new results, and its main contributions are to (i) propose a very general test for detecting the HDE, regardless of its underlying cause; (ii) fundamentally characterize the HDE by pairwise ratios of Wald and Rao score and likelihood ratio test statistics for 1-parameter distributions; (iii) show that the parameter space may be partitioned into an interior encased by 5 HDE severity measures (faint, weak, moderate, strong, extreme); (iv) prove that a necessary condition for the HDE in a 2 by 2 table is a log odds ratio of at least 2; (v) give some practical guidelines about HDE-free hypothesis testing. Overall, practical post-fit tests can now be conducted potentially to any model estimated by iteratively reweighted least squares, such as the generalized linear model (GLM) and Vector GLM (VGLM) classes, the latter which encompasses many popular regression models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2020

p-value peeking and estimating extrema

A pervasive issue in statistical hypothesis testing is that the reported...
research
07/08/2021

Likelihood-Free Frequentist Inference: Bridging Classical Statistics and Machine Learning in Simulation and Uncertainty Quantification

Many areas of science make extensive use of computer simulators that imp...
research
02/15/2021

On the Inability of the Higher Criticism to Detect Rare/Weak Departures

Consider a multiple hypothesis testing setting involving rare/weak featu...
research
11/23/2017

Multiple Improvements of Multiple Imputation Likelihood Ratio Tests

Multiple imputation (MI) inference handles missing data by first properl...
research
08/29/2020

Efficiency Loss of Asymptotically Efficient Tests in an Instrumental Variables Regression

In an instrumental variable model, the score statistic can be stochastic...
research
07/15/2021

Optimal tests of the composite null hypothesis arising in mediation analysis

The indirect effect of an exposure on an outcome through an intermediate...
research
10/14/2022

Conditional Likelihood Ratio Test with Many Weak Instruments

This paper extends validity of the conditional likelihood ratio (CLR) te...

Please sign up or login with your details

Forgot password? Click here to reset