Probabilistic Value Selection for Space Efficient Model

07/09/2020
by   Gunarto Sindoro Njoo, et al.
0

An alternative to current mainstream preprocessing methods is proposed: Value Selection (VS). Unlike the existing methods such as feature selection that removes features and instance selection that eliminates instances, value selection eliminates the values (with respect to each feature) in the dataset with two purposes: reducing the model size and preserving its accuracy. Two probabilistic methods based on information theory's metric are proposed: PVS and P + VS. Extensive experiments on the benchmark datasets with various sizes are elaborated. Those results are compared with the existing preprocessing methods such as feature selection, feature transformation, and instance selection methods. Experiment results show that value selection can achieve the balance between accuracy and model size reduction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2017

S-Shaped vs. V-Shaped Transfer Functions for Antlion Optimization Algorithm in Feature Selection Problems

Feature selection is an important preprocessing step for classification ...
research
07/06/2022

DIWIFT: Discovering Instance-wise Influential Features for Tabular Data

Tabular data is one of the most common data storage formats in business ...
research
06/11/2022

Feature Selection using e-values

In the context of supervised parametric models, we introduce the concept...
research
06/25/2023

Fast Classification with Sequential Feature Selection in Test Phase

This paper introduces a novel approach to active feature acquisition for...
research
11/16/2021

Outlier Detection as Instance Selection Method for Feature Selection in Time Series Classification

In order to allow machine learning algorithms to extract knowledge from ...
research
01/21/2020

Wrapper Feature Selection Algorithm for the Optimization of an Indicator System of Patent Value Assessment

Effective patent value assessment provides decision support for patent t...
research
03/08/2022

Model-free feature selection to facilitate automatic discovery of divergent subgroups in tabular data

Data-centric AI encourages the need of cleaning and understanding of dat...

Please sign up or login with your details

Forgot password? Click here to reset