A practically efficient fixed-pivot selection algorithm and its extensible MATLAB suite

02/11/2023
by   Ivano Azzini, et al.
0

Many statistical problems and applications require repeated computation of order statistics, such as the median, but most statistical and programming environments do not offer in their main distribution linear selection algorithms. We introduce one, formally equivalent to quickselect, which keeps the position of the pivot fixed. This makes the implementation simpler and much practical compared with the best known solutions. It also enables an "oracular" pivot position option that can reduce a lot the convergence time of certain statistical applications. We have extended the algorithm to weighted percentiles such as the weighted median, applicable to data associated with varying precision measurements, image filtering, descriptive statistics like the medcouple and for combining multiple predictors in boosting algorithms. We provide the new functions in MATLAB, C and R. We have packaged them in a broad MATLAB toolbox addressing robust statistical methods, many of which can be now optimised by means of efficient (weighted) selections.

READ FULL TEXT
research
02/20/2014

Statistical Constraints

We introduce statistical constraints, a declarative modelling tool that ...
research
07/07/2021

Oblivious Median Slope Selection

We study the median slope selection problem in the oblivious RAM model. ...
research
03/04/2021

Variance Reduced Median-of-Means Estimator for Byzantine-Robust Distributed Inference

This paper develops an efficient distributed inference algorithm, which ...
research
09/30/2021

On simultaneous best linear unbiased prediction of future order statistics and associated properties

In this article, the joint best linear unbiased predictors (BLUPs) of tw...
research
04/12/2020

A Machine Learning Approach for Flagging Incomplete Bid-rigging Cartels

We propose a new method for flagging bid rigging, which is particularly ...
research
08/06/2017

Declarative Statistics

In this work we introduce declarative statistics, a suite of declarative...
research
01/06/2022

Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates and Model Misspecification

The problem of linear predictions has been extensively studied for the p...

Please sign up or login with your details

Forgot password? Click here to reset