Sufficient reductions in regression with mixed predictors

10/25/2021
by   Efstathia Bura, et al.
0

Most data sets comprise of measurements on continuous and categorical variables. In regression and classification Statistics literature, modeling high-dimensional mixed predictors has received limited attention. In this paper we study the general regression problem of inferring on a variable of interest based on high dimensional mixed continuous and binary predictors. The aim is to find a lower dimensional function of the mixed predictor vector that contains all the modeling information in the mixed predictors for the response, which can be either continuous or categorical. The approach we propose identifies sufficient reductions by reversing the regression and modeling the mixed predictors conditional on the response. We derive the maximum likelihood estimator of the sufficient reductions, asymptotic tests for dimension, and a regularized estimator, which simultaneously achieves variable (feature) selection and dimension reduction (feature extraction). We study the performance of the proposed method and compare it with other approaches through simulations and real data examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2020

Sufficient Dimension Reduction for Interactions

Dimension reduction lies at the heart of many statistical methods. In re...
research
07/15/2020

A likelihood-based approach for multivariate categorical response regression in high dimensions

We propose a penalized likelihood method to fit the bivariate categorica...
research
06/21/2022

Conditional probability tensor decompositions for multivariate categorical response regression

In many modern regression applications, the response consists of multipl...
research
12/14/2021

Linear Discriminant Analysis with High-dimensional Mixed Variables

Datasets containing both categorical and continuous variables are freque...
research
09/28/2019

A New Covariance Estimator for Sufficient Dimension Reduction in High-Dimensional and Undersized Sample Problems

The application of standard sufficient dimension reduction methods for r...
research
10/14/2022

Variable Importance Based Interaction Modeling with an Application on Initial Spread of COVID-19 in China

Interaction selection for linear regression models with both continuous ...
research
11/04/2021

Nonparametric Regression and Classification with Functional, Categorical, and Mixed Covariates

We consider nonparametric prediction with multiple covariates, in partic...

Please sign up or login with your details

Forgot password? Click here to reset