Interactive Reinforcement Learning for Feature Selection with Decision Tree in the Loop

10/02/2020
by   Wei Fan, et al.
49

We study the problem of balancing effectiveness and efficiency in automated feature selection. After exploring many feature selection methods, we observe a computational dilemma: 1) traditional feature selection is mostly efficient, but difficult to identify the best subset; 2) the emerging reinforced feature selection automatically navigates to the best subset, but is usually inefficient. Can we bridge the gap between effectiveness and efficiency under automation? Motivated by this dilemma, we aim to develop a novel feature space navigation method. In our preliminary work, we leveraged interactive reinforcement learning to accelerate feature selection by external trainer-agent interaction. In this journal version, we propose a novel interactive and closed-loop architecture to simultaneously model interactive reinforcement learning (IRL) and decision tree feedback (DTF). Specifically, IRL is to create an interactive feature selection loop and DTF is to feed structured feature knowledge back to the loop. First, the tree-structured feature hierarchy from decision tree is leveraged to improve state representation. In particular, we represent the selected feature subset as an undirected graph of feature-feature correlations and a directed tree of decision features. We propose a new embedding method capable of empowering graph convolutional network to jointly learn state representation from both the graph and the tree. Second, the tree-structured feature hierarchy is exploited to develop a new reward scheme. In particular, we personalize reward assignment of agents based on decision tree feature importance. In addition, observing agents' actions can be feedback, we devise another reward scheme, to weigh and assign reward based on the feature selected frequency ratio in historical action records. Finally, we present extensive experiments on real-world datasets to show the improved performance.

READ FULL TEXT

page 9

page 10

page 11

research
08/27/2020

AutoFS: Automated Feature Selection via Diversity-aware Interactive Reinforcement Learning

In this paper, we study the problem of balancing effectiveness and effic...
research
05/12/2022

Feature and Instance Joint Selection: A Reinforcement Learning Perspective

Feature selection and instance selection are two important techniques of...
research
12/31/2020

Automatic Historical Feature Generation through Tree-based Method in Ads Prediction

Historical features are important in ads click-through rate (CTR) predic...
research
09/19/2020

Simplifying Reinforced Feature Selection via Restructured Choice Strategy of Single Agent

Feature selection aims to select a subset of features to optimize the pe...
research
09/29/2021

Efficient Reinforced Feature Selection via Early Stopping Traverse Strategy

In this paper, we propose a single-agent Monte Carlo based reinforced fe...
research
08/11/2020

Network Intrusion Detection Using Wrapper-based Decision Tree for Feature Selection

One of the key challenges of machine learning (ML) based intrusion detec...
research
06/12/2020

Generalizing Gain Penalization for Feature Selection in Tree-based Models

We develop a new approach for feature selection via gain penalization in...

Please sign up or login with your details

Forgot password? Click here to reset