Sparsity-based Feature Selection for Anomalous Subgroup Discovery

by   Girmaw Abebe Tadesse, et al.

Anomalous pattern detection aims to identify instances where deviation from normalcy is evident, and is widely applicable across domains. Multiple anomalous detection techniques have been proposed in the state of the art. However, there is a common lack of a principled and scalable feature selection method for efficient discovery. Existing feature selection techniques are often conducted by optimizing the performance of prediction outcomes rather than its systemic deviations from the expected. In this paper, we proposed a sparsity-based automated feature selection (SAFS) framework, which encodes systemic outcome deviations via the sparsity of feature-driven odds ratios. SAFS is a model-agnostic approach with usability across different discovery techniques. SAFS achieves more than 3× reduction in computation time while maintaining detection performance when validated on publicly available critical care dataset. SAFS also results in a superior performance when compared against multiple baselines for feature selection.



There are no comments yet.


page 3


Automated Supervised Feature Selection for Differentiated Patterns of Care

An automated feature selection pipeline was developed using several stat...

Model-free feature selection to facilitate automatic discovery of divergent subgroups in tabular data

Data-centric AI encourages the need of cleaning and understanding of dat...

Post-discovery Analysis of Anomalous Subsets

Analyzing the behaviour of a population in response to disease and inter...

Performance Optimization of a Fuzzy Entropy based Feature Selection and Classification Framework

In this paper, based on a fuzzy entropy feature selection framework, dif...

Charge-Based Prison Term Prediction with Deep Gating Network

Judgment prediction for legal cases has attracted much research efforts ...

Sparse Feature Selection in Kernel Discriminant Analysis via Optimal Scoring

We consider the two-group classification problem and propose a kernel cl...

Outlier Detection as Instance Selection Method for Feature Selection in Time Series Classification

In order to allow machine learning algorithms to extract knowledge from ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.