Optimal Discriminant Analysis in High-Dimensional Latent Factor Models

10/23/2022
by   Xin Bing, et al.
0

In high-dimensional classification problems, a commonly used approach is to first project the high-dimensional features into a lower dimensional space, and base the classification on the resulting lower dimensional projections. In this paper, we formulate a latent-variable model with a hidden low-dimensional structure to justify this two-step procedure and to guide which projection to choose. We propose a computationally efficient classifier that takes certain principal components (PCs) of the observed features as projections, with the number of retained PCs selected in a data-driven way. A general theory is established for analyzing such two-step classifiers based on any projections. We derive explicit rates of convergence of the excess risk of the proposed PC-based classifier. The obtained rates are further shown to be optimal up to logarithmic factors in the minimax sense. Our theory allows the lower-dimension to grow with the sample size and is also valid even when the feature dimension (greatly) exceeds the sample size. Extensive simulations corroborate our theoretical findings. The proposed method also performs favorably relative to other existing discriminant methods on three real data examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2023

Minimax optimal high-dimensional classification using deep neural networks

High-dimensional classification is a fundamentally important research pr...
research
02/22/2019

Model-based clustering in very high dimensions via adaptive projections

Mixture models are a standard approach to dealing with heterogeneous dat...
research
02/16/2015

Random Subspace Learning Approach to High-Dimensional Outliers Detection

We introduce and develop a novel approach to outlier detection based on ...
research
10/25/2022

Interpolating Discriminant Functions in High-Dimensional Gaussian Latent Mixtures

This paper considers binary classification of high-dimensional features ...
research
02/01/2022

Bootstrap Confidence Regions for Learned Feature Embeddings

Algorithmic feature learners provide high-dimensional vector representat...
research
11/04/2016

Classification with Ultrahigh-Dimensional Features

Although much progress has been made in classification with high-dimensi...
research
02/28/2021

Optimal Imperfect Classification for Gaussian Functional Data

Existing works on functional data classification focus on the constructi...

Please sign up or login with your details

Forgot password? Click here to reset