A distributed regression analysis application based on SAS software. Part I: Linear and logistic regression

08/07/2018
by   Qoua L. Her, et al.
0

Previous work has demonstrated the feasibility and value of conducting distributed regression analysis (DRA), a privacy-protecting analytic method that performs multivariable-adjusted regression analysis with only summary-level information from participating sites. To our knowledge, there are no DRA applications in SAS, the statistical software used by several large national distributed data networks (DDNs), including the Sentinel System and PCORnet. SAS/IML is available to perform the required matrix computations for DRA in the SAS system. However, not all data partners in these large DDNs have access to SAS/IML, which is licensed separately. In this first article of a two-paper series, we describe a DRA application developed for use in Base SAS and SAS/STAT modules for linear and logistic DRA within horizontally partitioned DDNs and its successful tests.

READ FULL TEXT
research
08/07/2018

A distributed regression analysis application based on SAS software Part II: Cox proportional hazards regression

Previous work has demonstrated the feasibility and value of conducting d...
research
06/18/2021

Scalable Econometrics on Big Data – The Logistic Regression on Spark

Extra-large datasets are becoming increasingly accessible, and computing...
research
03/31/2023

Almost Linear Constant-Factor Sketching for ℓ_1 and Logistic Regression

We improve upon previous oblivious sketching and turnstile streaming res...
research
10/03/2019

Minimax Bounds for Distributed Logistic Regression

We consider a distributed logistic regression problem where labeled data...
research
04/05/2023

Distributed Logistic Regression for Massive Data with Rare Events

Large-scale rare events data are commonly encountered in practice. To ta...
research
02/19/2021

Do NHL goalies get hot in the playoffs? A multilevel logistic regression analysis

The hot-hand theory posits that an athlete who has performed well in the...
research
10/03/2015

Distributed Parameter Map-Reduce

This paper describes how to convert a machine learning problem into a se...

Please sign up or login with your details

Forgot password? Click here to reset