Automatic Search Interval for Smoothing Parameter in Penalized Splines

05/30/2022
by   Zheyuan Li, et al.
0

The selection of smoothing parameter is central to estimation of penalized splines. The best parameter value is often the one that optimizes a smoothness selection criterion, like the minimizer of generalized cross-validation error (GCV) and the maximizer of restricted likelihood (REML). To avoid ending up with an undesired local extremum rather than the global extremum, grid search should be used for optimization. Unfortunately, the method requires a pre-specified search interval that contains the unknown global extremum and there has not been any theory on how it could be provided. As a result, practitioners have to find it by trial and error. To overcome such difficulty, we develop novel algorithms to automatically find this interval. Our automatic search interval has four advantages. (i) It specifies a smoothing parameter range where the penalized least squares problem is numerically solvable. (ii) It is criterion-independent, so that different criteria like GCV and REML can be explored on the same parameter range. (iii) It is sufficiently wide to contain the global extremum of any criterion, so that for example, the global minimum of GCV and the global maximum of REML can both be identified. (iv) It is computationally cheap compared with grid search so that it carries no extra costs in practice. Our method is ready to use through R package gps (>= version 1.1). It may be embedded in other advanced statistical modeling methods that rely on penalized splines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2009

Tuning parameter selection for penalized likelihood estimation of inverse covariance matrix

In a Gaussian graphical model, the conditional independence between two ...
research
01/15/2021

Matrix-free Penalized Spline Smoothing with Multiple Covariates

The paper motivates high dimensional smoothing with penalized splines an...
research
04/26/2023

A Statistical Interpretation of the Maximum Subarray Problem

Maximum subarray is a classical problem in computer science that given a...
research
05/28/2020

Composition Estimation via Shrinkage

In this note, we explore a simple approach to composition estimation, us...
research
06/26/2022

Prediction Errors for Penalized Regressions based on Generalized Approximate Message Passing

We discuss the prediction accuracy of assumed statistical models in term...
research
04/15/2019

Proportional hazards model with partly interval censoring and its penalized likelihood estimation

This paper considers the problem of semi-parametric proportional hazards...
research
12/30/2016

Adaptive Lambda Least-Squares Temporal Difference Learning

Temporal Difference learning or TD(λ) is a fundamental algorithm in the ...

Please sign up or login with your details

Forgot password? Click here to reset