Sample Size Calculations in Simple Linear Regression: Trials and Tribulations

07/24/2019
by   Tianyuan Guan, et al.
0

The problem tackled in this paper is the determination of sample size for a given level and power in the context of a simple linear regression model. At a technical level, the simple linear regression model is a five-parameter model. It is natural to base sample size calculations on the least squares' estimator of the slope parameter of the model. Nuisance parameters such as the variance of the predictor X and conditional variance of the response Y create problems in the calculations. The current approaches in the literature are not illuminating. One approach is based on the conditional distribution of the estimator of the slope parameter given the data on the predictor X. Another approach is based on the sample correlation coefficient. We overcome the problems by determining the exact unconditional distribution of the test statistic built on the estimator of the slope parameter. The exact unconditional distribution alleviates difficulties to some extent in the computation of sample sizes. On the other hand, the test based on the sample correlation coefficient of X and Y avoids the problems besetting the test based on the slope parameter. However, we lose intuitive interpretation that comes with the slope parameter. Surprisingly, we see that the sample size that comes from the correlation test works in synchronization with the one that comes from the test built upon the slope parameter in a broad array of settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2018

Minimax rate of testing in sparse linear regression

We consider the problem of testing the hypothesis that the parameter of ...
research
08/22/2023

Hybrid sample size calculations for cluster randomised trials using assurance

Sample size determination for cluster randomised trials (CRTs) is challe...
research
08/18/2021

On variance estimation for the one-sample log-rank test

Time-to-event endpoints show an increasing popularity in phase II cancer...
research
08/21/2019

Efficient and flexible simulation-based sample size determination for clinical trials with multiple design parameters

Simulation offers a simple and flexible way to estimate the power of a c...
research
10/22/2020

Sharp Bias-variance Tradeoffs of Hard Parameter Sharing in High-dimensional Linear Regression

Hard parameter sharing for multi-task learning is widely used in empiric...
research
02/19/2020

A non-inferiority test for R-squared with random regressors

Determining the lack of association between an outcome variable and a nu...
research
10/04/2018

Correcting the bias in least squares regression with volume-rescaled sampling

Consider linear regression where the examples are generated by an unknow...

Please sign up or login with your details

Forgot password? Click here to reset