Diverse Online Feature Selection

06/12/2018
by   Chapman Siu, et al.
0

Online feature selection has been an active research area in recent years. We propose a novel diverse online feature selection method based on Determinantal Point Processes (DPP). Our model aims to provide diverse features which can be composed in either a supervised or unsupervised framework. The framework aims to promote diversity based on the kernel produced on a feature level, through at most three stages: feature sampling, local criteria and global criteria for feature selection. In the feature sampling, we sample incoming stream of features using conditional DPP. The local criteria is used to assess and select streamed features (i.e. only when they arrive), we use unsupervised scale invariant methods to remove redundant features and optionally supervised methods to introduce label information to assess relevant features. Lastly, the global criteria uses regularization methods to select a global optimal subset of features. This three stage procedure continues until there are no more features arriving or some predefined stopping condition is met. We demonstrate based on experiments conducted on that this approach yields better compactness, is comparable and in some instances outperforms other state-of-the-art online feature selection methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/18/2014

Online Group Feature Selection

Online feature selection with dynamic features has become an active rese...
research
08/21/2016

Online Feature Selection with Group Structure Analysis

Online selection of dynamic features has attracted intensive interest in...
research
07/05/2020

Block Model Guided Unsupervised Feature Selection

Feature selection is a core area of data mining with a recent innovation...
research
07/02/2021

Few-shot Learning for Unsupervised Feature Selection

We propose a few-shot learning method for unsupervised feature selection...
research
04/07/2020

Automatically Assessing Quality of Online Health Articles

The information ecosystem today is overwhelmed by an unprecedented quant...
research
01/25/2021

A Two-stage Framework for Compound Figure Separation

Scientific literature contains large volumes of complex, unstructured fi...
research
03/06/2023

Video traffic identification with novel feature extraction and selection method

In recent years, the rapid rise of video applications has led to an expl...

Please sign up or login with your details

Forgot password? Click here to reset