Robust density estimation with the 𝕃_1-loss. Applications to the estimation of a density on the line satisfying a shape constraint

05/21/2022
by   Y. Baraud, et al.
0

We solve the problem of estimating the distribution of presumed i.i.d. observations for the total variation loss. Our approach is based on density models and is versatile enough to cope with many different ones, including some density models for which the Maximum Likelihood Estimator (MLE for short) does not exist. We mainly illustrate the properties of our estimator on models of densities on the line that satisfy a shape constraint. We show that it possesses some similar optimality properties, with regard to some global rates of convergence, as the MLE does when it exists. It also enjoys some adaptation properties with respect to some specific target densities in the model for which our estimator is proven to converge at parametric rate. More important is the fact that our estimator is robust, not only with respect to model misspecification, but also to contamination, the presence of outliers among the dataset and the equidistribution assumption. This means that the estimator performs almost as well as if the data were i.i.d. with density p in a situation where these data are only independent and most of their marginals are close enough in total variation to a distribution with density p. We also show that our estimator converges to the average density of the data, when this density belongs to the model, even when none of the marginal densities belongs to it. Our main result on the risk of the estimator takes the form of an exponential deviation inequality which is non-asymptotic and involves explicit numerical constants. We deduce from it several global rates of convergence, including some bounds for the minimax 𝕃_1-risks over the sets of concave and log-concave densities. These bounds derive from some specific results on the approximation of densities which are monotone, convex, concave and log-concave. Such results may be of independent interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2018

Adaptation in multivariate log-concave density estimation

We study the adaptation properties of the multivariate log-concave maxim...
research
03/14/2019

High-dimensional nonparametric density estimation via symmetry and shape constraints

We tackle the problem of high-dimensional nonparametric density estimati...
research
03/13/2019

The Log-Concave Maximum Likelihood Estimator is Optimal in High Dimensions

We study the problem of learning a d-dimensional log-concave distributio...
research
12/13/2018

A Polynomial Time Algorithm for Maximum Likelihood Estimation of Multivariate Log-concave Densities

We study the problem of computing the maximum likelihood estimator (MLE)...
research
11/14/2019

Location estimation for symmetric log-concave densities

We revisit the problem of estimating the center of symmetry θ of an unkn...
research
01/23/2013

Relative Loss Bounds for On-line Density Estimation with the Exponential Family of Distributions

We consider on-line density estimation with a parameterized density from...
research
04/04/2018

Shape-Constrained Univariate Density Estimation

While the problem of estimating a probability density function (pdf) fro...

Please sign up or login with your details

Forgot password? Click here to reset