Random Bits Regression: a Strong General Predictor for Big Data

01/13/2015
by   Yi Wang, et al.
0

To improve accuracy and speed of regressions and classifications, we present a data-based prediction method, Random Bits Regression (RBR). This method first generates a large number of random binary intermediate/derived features based on the original input matrix, and then performs regularized linear/logistic regression on those intermediate/derived features to predict the outcome. Benchmark analyses on a simulated dataset, UCI machine learning repository datasets and a GWAS dataset showed that RBR outperforms other popular methods in accuracy and robustness. RBR (available on https://sourceforge.net/projects/rbr/) is very fast and requires reasonable memories, therefore, provides a strong, robust and fast predictor in the big data era.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2021

Efficient and robust high-dimensional sparse logistic regression via nonlinear primal-dual hybrid gradient algorithms

Logistic regression is a widely used statistical model to describe the r...
research
04/05/2022

A robust scalar-on-function logistic regression for classification

Scalar-on-function logistic regression, where the response is a binary o...
research
04/27/2016

Local Uncertainty Sampling for Large-Scale Multi-Class Logistic Regression

A major challenge for building statistical models in the big data era is...
research
05/17/2021

Classifying variety of customer's online engagement for churn prediction with mixed-penalty logistic regression

Using big data to analyze consumer behavior can provide effective decisi...
research
06/30/2021

Robust Coreset for Continuous-and-Bounded Learning (with Outliers)

In this big data era, we often confront large-scale data in many machine...
research
11/13/2016

Accelerated Variance Reduced Block Coordinate Descent

Algorithms with fast convergence, small number of data access, and low p...
research
03/12/2020

Post-Estimation Smoothing: A Simple Baseline for Learning with Side Information

Observational data are often accompanied by natural structural indices, ...

Please sign up or login with your details

Forgot password? Click here to reset