On Measuring Model Complexity in Heteroscedastic Linear Regression

04/14/2022
by   Bo Luan, et al.
0

Heteroscedasticity is common in real world applications and is often handled by incorporating case weights into a modeling procedure. Intuitively, models fitted with different weight schemes would have a different level of complexity depending on how well the weights match the inverse of error variances. However, existing statistical theories on model complexity, also known as model degrees of freedom, were primarily established under the assumption of equal error variances. In this work, we focus on linear regression procedures and seek to extend the existing measures to a heteroscedastic setting. Our analysis of the weighted least squares method reveals some interesting properties of the extended measures. In particular, we find that they depend on both the weights used for model fitting and those for model evaluation. Moreover, modeling heteroscedastic data with optimal weights generally results in fewer degrees of freedom than with equal weights, and the size of reduction depends on the unevenness of error variance. This provides additional insights into weighted modeling procedures that are useful in risk estimation and model selection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2021

Predictive Model Degrees of Freedom in Linear Regression

Overparametrized interpolating models have drawn increasing attention fr...
research
11/12/2013

When Does More Regularization Imply Fewer Degrees of Freedom? Sufficient Conditions and Counter Examples from Lasso and Ridge Regression

Regularization aims to improve prediction performance of a given statist...
research
06/06/2018

Degrees of Freedom and Model Selection for kmeans Clustering

This paper investigates the problem of model selection for kmeans cluste...
research
10/31/2022

Three Properties of F-Statistics for Multiple Regression and ANOVA

This paper establishes three properties of F-statistics for inference ab...
research
11/22/2019

On the use of information criteria for subset selection in least squares regression

Least squares (LS) based subset selection methods are popular in linear ...
research
08/25/2023

Degrees of Freedom: Search Cost and Self-consistency

Model degrees of freedom () is a fundamental concept in statistics becau...

Please sign up or login with your details

Forgot password? Click here to reset