Automated Supervised Feature Selection for Differentiated Patterns of Care

11/05/2021
by   Catherine Wanjiru, et al.
0

An automated feature selection pipeline was developed using several state-of-the-art feature selection techniques to select optimal features for Differentiating Patterns of Care (DPOC). The pipeline included three types of feature selection techniques; Filters, Wrappers and Embedded methods to select the top K features. Five different datasets with binary dependent variables were used and their different top K optimal features selected. The selected features were tested in the existing multi-dimensional subset scanning (MDSS) where the most anomalous subpopulations, most anomalous subsets, propensity scores, and effect of measures were recorded to test their performance. This performance was compared with four similar metrics gained after using all covariates in the dataset in the MDSS pipeline. We found out that despite the different feature selection techniques used, the data distribution is key to note when determining the technique to use.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/06/2022

Sparsity-based Feature Selection for Anomalous Subgroup Discovery

Anomalous pattern detection aims to identify instances where deviation f...
research
08/19/2023

Utilizing Semantic Textual Similarity for Clinical Survey Data Feature Selection

Survey data can contain a high number of features while having a compara...
research
03/08/2022

Model-free feature selection to facilitate automatic discovery of divergent subgroups in tabular data

Data-centric AI encourages the need of cleaning and understanding of dat...
research
02/01/2020

On the Consistency of Optimal Bayesian Feature Selection in the Presence of Correlations

Optimal Bayesian feature selection (OBFS) is a multivariate supervised s...
research
07/10/2022

FIB: A Method for Evaluation of Feature Impact Balance in Multi-Dimensional Data

Errors might not have the same consequences depending on the task at han...
research
06/26/2018

AutoSpearman: Automatically Mitigating Correlated Metrics for Interpreting Defect Models

The interpretation of defect models heavily relies on software metrics t...
research
03/21/2016

Static and Dynamic Feature Selection in Morphosyntactic Analyzers

We study the use of greedy feature selection methods for morphosyntactic...

Please sign up or login with your details

Forgot password? Click here to reset