On the uses and abuses of regression models: a call for reform of statistical practice and teaching

09/13/2023
by   John B Carlin, et al.
0

When students and users of statistical methods first learn about regression analysis there is an emphasis on the technical details of models and estimation methods that invariably runs ahead of the purposes for which these models might be used. More broadly, statistics is widely understood to provide a body of techniques for "modelling data", underpinned by what we describe as the "true model myth", according to which the task of the statistician/data analyst is to build a model that closely approximates the true data generating process. By way of our own historical examples and a brief review of mainstream clinical research journals, we describe how this perspective leads to a range of problems in the application of regression methods, including misguided "adjustment" for covariates, misinterpretation of regression coefficients and the widespread fitting of regression models without a clear purpose. We then outline an alternative approach to the teaching and application of regression methods, which begins by focussing on clear definition of the substantive research question within one of three distinct types: descriptive, predictive, or causal. The simple univariable regression model may be introduced as a tool for description, while the development and application of multivariable regression models should proceed differently according to the type of question. Regression methods will no doubt remain central to statistical practice as they provide a powerful tool for representing variation in a response or outcome variable as a function of "input" variables, but their conceptualisation and usage should follow from the purpose at hand.

READ FULL TEXT

page 7

page 8

research
06/23/2018

Assumption Lean Regression

It is well known that models used in conventional regression analysis ar...
research
04/07/2020

Robust inference for nonlinear regression models from the Tsallis score: application to Covid-19 contagion in Italy

We discuss an approach for fitting robust nonlinear regression models, w...
research
09/25/2020

Regressor: A C program for Combinatorial Regressions

In statistics, researchers use Regression models for data analysis and p...
research
12/11/2019

Parametric mode regression for bounded data

We propose new parametric frameworks of regression analysis with the con...
research
05/16/2022

CurFi: An automated tool to find the best regression analysis model using curve fitting

Regression analysis is a well known quantitative research method that pr...
research
03/27/2022

Are Instrumental Variables Really That Instrumental? Endogeneity Resolution in Regression Models for Comparative Studies

We provide a justification for why, and when, endogeneity will not cause...
research
06/22/2022

Optimally Weighted Ensembles of Regression Models: Exact Weight Optimization and Applications

Automated model selection is often proposed to users to choose which mac...

Please sign up or login with your details

Forgot password? Click here to reset