maars: Tidy Inference under the 'Models as Approximations' Framework in R

06/21/2021
by   Riccardo Fogliato, et al.
0

Linear regression using ordinary least squares (OLS) is a critical part of every statistician's toolkit. In R, this is elegantly implemented via lm() and its related functions. However, the statistical inference output from this suite of functions is based on the assumption that the model is well specified. This assumption is often unrealistic and at best satisfied approximately. In the statistics and econometrics literature, this has long been recognized and a large body of work provides inference for OLS under more practical assumptions. This can be seen as model-free inference. In this paper, we introduce our package maars ("models as approximations") that aims at bringing research on model-free inference to R via a comprehensive workflow. The maars package differs from other packages that also implement variance estimation, such as sandwich, in three key ways. First, all functions in maars follow a consistent grammar and return output in tidy format, with minimal deviation from the typical lm() workflow. Second, maars contains several tools for inference including empirical, multiplier, residual bootstrap, and subsampling, for easy comparison. Third, maars is developed with pedagogy in mind. For this, most of its functions explicitly return the assumptions under which the output is valid. This key innovation makes maars useful in teaching inference under misspecification and also a powerful tool for applied researchers. We hope our default feature of explicitly presenting assumptions will become a de facto standard for most statistical modeling in R.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2018

Model-free Study of Ordinary Least Squares Linear Regression

Ordinary least squares (OLS) linear regression is one of the most basic ...
research
06/23/2018

Assumption Lean Regression

It is well known that models used in conventional regression analysis ar...
research
06/11/2021

Bootstrapping Clustered Data in R using lmeresampler

Linear mixed-effects models are commonly used to analyze clustered data ...
research
12/14/2021

The Importance of Discussing Assumptions when Teaching Bootstrapping

Bootstrapping and other resampling methods are progressively appearing i...
research
04/14/2016

Distribution-Free Predictive Inference For Regression

We develop a general framework for distribution-free predictive inferenc...
research
12/16/2020

A connection between the pattern classification problem and the General Linear Model for statistical inference

A connection between the General Linear Model (GLM) in combination with ...
research
09/03/2023

Tropical Geometric Tools for Machine Learning: the TML package

In the last decade, developments in tropical geometry have provided a nu...

Please sign up or login with your details

Forgot password? Click here to reset