Asymptotic Inference for Infinitely Imbalanced Logistic Regression

04/27/2022
by   Dorian Goldman, et al.
0

In this paper we extend the work of Owen (2007) by deriving a second order expansion for the slope parameter in logistic regression, when the size of the majority class is unbounded and the minority class is finite. More precisely, we demonstrate that the second order term converges to a normal distribution and explicitly compute its variance, which surprisingly once again depends only on the mean of the minority class points and not their arrangement under mild regularity assumptions. In the case that the majority class is normally distributed, we illustrate that the variance of the the limiting slope depends exponentially on the z-score of the average of the minority class's points with respect to the majority class's distribution. We confirm our results by Monte Carlo simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2019

F-measure Maximizing Logistic Regression

Logistic regression is a widely used method in several fields. When appl...
research
02/26/2019

Logarithmic Regret for parameter-free Online Logistic Regression

We consider online optimization procedures in the context of logistic re...
research
12/28/2018

A Descriptive Study of Variable Discretization and Cost-Sensitive Logistic Regression on Imbalanced Credit Data

Training classification models on imbalanced data sets tends to result i...
research
09/25/2021

Random Walk-steered Majority Undersampling

In this work, we propose Random Walk-steered Majority Undersampling (RWM...
research
07/03/2020

On Second order correctness of Bootstrap in Logistic Regression

In the fields of clinical trials, biomedical surveys, marketing, banking...
research
10/09/2020

Sparse network asymptotics for logistic regression

Consider a bipartite network where N consumers choose to buy or not to b...
research
02/24/2018

Dimensionally Tight Running Time Bounds for Second-Order Hamiltonian Monte Carlo

Hamiltonian Monte Carlo (HMC) is a widely deployed method to sample from...

Please sign up or login with your details

Forgot password? Click here to reset