A Statistical Approach to Set Classification by Feature Selection with Applications to Classification of Histopathology Images

02/19/2014
by   Sungkyu Jung, et al.
0

Set classification problems arise when classification tasks are based on sets of observations as opposed to individual observations. In set classification, a classification rule is trained with N sets of observations, where each set is labeled with class information, and the prediction of a class label is performed also with a set of observations. Data sets for set classification appear, for example, in diagnostics of disease based on multiple cell nucleus images from a single tissue. Relevant statistical models for set classification are introduced, which motivate a set classification framework based on context-free feature extraction. By understanding a set of observations as an empirical distribution, we employ a data-driven method to choose those features which contain information on location and major variation. In particular, the method of principal component analysis is used to extract the features of major variation. Multidimensional scaling is used to represent features as vector-valued points on which conventional classifiers can be applied. The proposed set classification approaches achieve better classification results than competing methods in a number of simulated data examples. The benefits of our method are demonstrated in an analysis of histopathology images of cell nuclei related to liver cancer.

READ FULL TEXT
research
01/02/2017

Towards multiple kernel principal component analysis for integrative analysis of tumor samples

Personalized treatment of patients based on tissue-specific cancer subty...
research
09/03/2010

Weighted Attribute Fusion Model for Face Recognition

Recognizing a face based on its attributes is an easy task for a human t...
research
11/25/2020

Feature Selection based on Principal Component Analysis for Underwater Source Localization by Deep Learning

In this paper, we propose an interpretable feature selection method base...
research
06/22/2022

Functional Nonlinear Learning

Using representations of functional data can be more convenient and bene...
research
05/11/2016

EEF: Exponentially Embedded Families with Class-Specific Features for Classification

In this letter, we present a novel exponentially embedded families (EEF)...
research
10/01/2021

A systematic evaluation of methods for cell phenotype classification using single-cell RNA sequencing data

Background: Single-cell RNA sequencing (scRNA-seq) yields valuable insig...
research
11/01/2017

Tensor Valued Common and Individual Feature Extraction: Multi-dimensional Perspective

A novel method for common and individual feature analysis from exceeding...

Please sign up or login with your details

Forgot password? Click here to reset