Dynamic Feature Scaling for Online Learning of Binary Classifiers

07/28/2014
by   Danushka Bollegala, et al.
0

Scaling feature values is an important step in numerous machine learning tasks. Different features can have different value ranges and some form of a feature scaling is often required in order to learn an accurate classifier. However, feature scaling is conducted as a preprocessing task prior to learning. This is problematic in an online setting because of two reasons. First, it might not be possible to accurately determine the value range of a feature at the initial stages of learning when we have observed only a few number of training instances. Second, the distribution of data can change over the time, which render obsolete any feature scaling that we perform in a pre-processing step. We propose a simple but an effective method to dynamically scale features at train time, thereby quickly adapting to any changes in the data stream. We compare the proposed dynamic feature scaling method against more complex methods for estimating scaling parameters using several benchmark datasets for binary classification. Our proposed feature scaling method consistently outperforms more complex methods on all of the benchmark datasets and improves classification accuracy of a state-of-the-art online binary classifier algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2017

Adaptive Scaling

Preprocessing data is an important step before any data analysis. In thi...
research
09/14/2016

Very Simple Classifier: a Concept Binary Classifier toInvestigate Features Based on Subsampling and Localility

We propose Very Simple Classifier (VSC) a novel method designed to incor...
research
08/25/2020

Evaluating Nonlinear Decision Trees for Binary Classification Tasks with Other Existing Methods

Classification of datasets into two or more distinct classes is an impor...
research
03/13/2019

Online Budgeted Learning for Classifier Induction

In real-world machine learning applications, there is a cost associated ...
research
05/26/2019

Deep Online Learning with Stochastic Constraints

Deep learning models are considered to be state-of-the-art in many offli...
research
04/28/2023

Online Platt Scaling with Calibeating

We present an online post-hoc calibration method, called Online Platt Sc...
research
03/19/2021

Empirical Analysis of Machine Learning Configurations for Prediction of Multiple Organ Failure in Trauma Patients

Multiple organ failure (MOF) is a life-threatening condition. Due to its...

Please sign up or login with your details

Forgot password? Click here to reset