Distance Functions and Normalization Under Stream Scenarios

06/30/2023
by   Eduardo V. L. Barboza, et al.
0

Data normalization is an essential task when modeling a classification system. When dealing with data streams, data normalization becomes especially challenging since we may not know in advance the properties of the features, such as their minimum/maximum values, and these properties may change over time. We compare the accuracies generated by eight well-known distance functions in data streams without normalization, normalized considering the statistics of the first batch of data received, and considering the previous batch received. We argue that experimental protocols for streams that consider the full stream as normalized are unrealistic and can lead to biased and poor results. Our results indicate that using the original data stream without applying normalization, and the Canberra distance, can be a good combination when no information about the data stream is known beforehand.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2019

Adaptive Normalization in Streaming Data

In todays digital era, data are everywhere from Internet of Things to he...
research
08/15/2019

Double-Coupling Learning for Multi-Task Data Stream Classification

Data stream classification methods demonstrate promising performance on ...
research
04/21/2023

Integrating Per-Stream Stat Tracking into Accel-Sim

Accel-Sim is a widely used computer architecture simulator that models t...
research
05/03/2023

Stream Efficient Learning

Data in many real-world applications are often accumulated over time, li...
research
12/15/2021

Simultaneous Monitoring of a Large Number of Heterogeneous Categorical Data Streams

This article proposes a powerful scheme to monitor a large number of cat...
research
02/27/2019

Regularity Normalization: Constraining Implicit Space with Minimum Description Length

Inspired by the adaptation phenomenon of biological neuronal firing rate...
research
12/27/2021

Self-normalized Classification of Parkinson's Disease DaTscan Images

Classifying SPECT images requires a preprocessing step which normalizes ...

Please sign up or login with your details

Forgot password? Click here to reset