An Ensemble Approach toward Automated Variable Selection for Network Anomaly Detection

by   Makiya Nakashima, et al.

While variable selection is essential to optimize the learning complexity by prioritizing features, automating the selection process is preferred since it requires laborious efforts with intensive analysis otherwise. However, it is not an easy task to enable the automation due to several reasons. First, selection techniques often need a condition to terminate the reduction process, for example, by using a threshold or the number of features to stop, and searching an adequate stopping condition is highly challenging. Second, it is uncertain that the reduced variable set would work well; our preliminary experimental result shows that well-known selection techniques produce different sets of variables as a result of reduction (even with the same termination condition), and it is hard to estimate which of them would work the best in future testing. In this paper, we demonstrate the potential power of our approach to the automation of selection process that incorporates well-known selection methods identifying important variables. Our experimental results with two public network traffic data (UNSW-NB15 and IDS2017) show that our proposed method identifies a small number of core variables, with which it is possible to approximate the performance to the one with the entire variables.


page 1

page 2

page 3

page 4


Conditional Variable Selection for Intelligent Test

Intelligent test requires efficient and effective analysis of high-dimen...

selectBoost: a general algorithm to enhance the performance of variable selection methods in correlated datasets

Motivation: With the growth of big data, variable selection has become o...

Generalized Variable Selection Algorithms for Gaussian Process Models by LASSO-like Penalty

With the rapid development of modern technology, massive amounts of data...

Ready When You Are: Efficient Condition Variables via Delegated Condition Evaluation

Multi-thread applications commonly utilize condition variables for commu...

DiscoVars: A New Data Analysis Perspective – Application in Variable Selection for Clustering

We present a new data analysis perspective to determine variable importa...

A comparison of different types of Niching Genetic Algorithms for variable selection in solar radiation estimation

Variable selection problems generally present more than a single solutio...

Controlled-Variable Selection based on Chaos Theory for the Tennessee Eastman Plant

This work explores a link between chaotic signals and the selection of c...

Please sign up or login with your details

Forgot password? Click here to reset