A robust multivariate linear non-parametric maximum likelihood model for ties

02/23/2021
by   Landon Hurley, et al.
0

Statistical analysis in applied research, across almost every field (e.g., biomedical, economics, computer science, and psychological) makes use of samples upon which the explicit error distribution of the dependent variable is unknown or, at best, difficult to linearly model. Yet, these assumptions are extremely common. Unknown distributions are of course biased when incorrectly specified, compromising the generalisability of our interpretations – the linearly unbiased Euclidean distance is very difficult to correctly identify upon finite samples and therefore results in an estimator which is neither unbiased nor maximally informative when incorrectly applied. The alternative common solution to the problem however, the use of non-parametric statistics, has its own fundamental flaws. In particular, these flaws revolve around the problem of order-statistics and the estimation in the presence of ties, which often removes the introduction of multiple independent variables and the estimation of interactions. We introduce a competitor to the Euclidean norm, the Kemeny norm, which we prove to be a valid Banach space, and construct a multivariate linear expansion of the Kendall-Theil-Sen estimator, which performs without compromising the parameter space extensibility, and establish its linear maximum likelihood properties. Empirical demonstrations upon both simulated and empirical data shall be used to demonstrate these properties, such that the new estimator is nearly equivalent in power for the glm upon Gaussian data, but grossly superior for a vast array of analytic scenarios, including finite ordinal sum-score analysis, thereby aiding in the resolution of replication in the Applied Sciences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/01/2023

An unbiased non-parametric correlation estimator in the presence of ties

An inner-product Hilbert space formulation of the Kemeny distance is def...
research
09/28/2022

Exact and efficient multivariate two-sample tests through generalized linear rank statistics

So-called linear rank statistics provide a means for distribution-free (...
research
07/20/2023

Unbiased analytic non-parametric correlation estimators in the presence of ties

An inner-product Hilbert space formulation is defined over a domain of a...
research
08/10/2019

Estimation of the Number of Components of Non-Parametric Multivariate Finite Mixture Models

We propose a novel estimator for the number of components (denoted by M)...
research
04/11/2022

Robust and Efficient Parameter Estimation for Discretely Observed Stochastic Processes

In various practical situations, we encounter data from stochastic proce...
research
07/20/2023

Studentising non-parametric correlation estimators

Studentisation upon rank-based linear estimators is generally considered...
research
05/17/2020

A Tutorial on Multivariate k-Statistics and their Computation

This document aims to provide an accessible tutorial on the unbiased est...

Please sign up or login with your details

Forgot password? Click here to reset