Efficient subsampling for exponential family models

06/29/2023
by   Subhadra Dasgupta, et al.
0

We propose a novel two-stage subsampling algorithm based on optimal design principles. In the first stage, we use a density-based clustering algorithm to identify an approximating design space for the predictors from an initial subsample. Next, we determine an optimal approximate design on this design space. Finally, we use matrix distances such as the Procrustes, Frobenius, and square-root distance to define the remaining subsample, such that its points are "closest" to the support points of the optimal design. Our approach reflects the specific nature of the information matrix as a weighted sum of non-negative definite Fisher information matrices evaluated at the design points and applies to a large class of regression models including models where the Fisher information is of rank larger than 1.

READ FULL TEXT
research
04/29/2019

Information geometry and asymptotic geodesics on the space of normal distributions

The family N of n-variate normal distributions is parameterized by the c...
research
09/01/2023

Information-based Optimal Subdata Selection for Clusterwise Linear Regression

Mixture-of-Experts models are commonly used when there exist distinct cl...
research
04/01/2021

Schrödinger encounters Fisher and Rao: a survey

In this short note we review the dynamical Schrödinger problem on the no...
research
05/27/2022

Information geometry of the Tojo-Yoshino's exponential family on the Poincaré upper plane

We study the dually flat information geometry of the Tojo-Yoshino expone...
research
03/20/2020

Design admissibility and de la Garza phenomenon in multi-factor experiments

The determination of an optimal design for a given regression problem is...
research
02/27/2018

On optimal designs for non-regular models

Classically, Fisher information is the relevant object in defining optim...
research
05/28/2018

Nonlinear Simplex Regression Models

In this paper, we propose a simplex regression model in which both the m...

Please sign up or login with your details

Forgot password? Click here to reset