Similarity Based Stratified Splitting: an approach to train better classifiers

10/13/2020
by   Felipe Farias, et al.
0

We propose a Similarity-Based Stratified Splitting (SBSS) technique, which uses both the output and input space information to split the data. The splits are generated using similarity functions among samples to place similar samples in different splits. This approach allows for a better representation of the data in the training phase. This strategy leads to a more realistic performance estimation when used in real-world applications. We evaluate our proposal in twenty-two benchmark datasets with classifiers such as Multi-Layer Perceptron, Support Vector Machine, Random Forest and K-Nearest Neighbors, and five similarity functions Cityblock, Chebyshev, Cosine, Correlation, and Euclidean. According to the Wilcoxon Sign-Rank test, our approach consistently outperformed ordinary stratified 10-fold cross-validation in 75% of the assessed scenarios.

READ FULL TEXT

page 4

page 8

page 9

research
09/17/2023

Using Artificial Neural Networks to Determine Ontologies Most Relevant to Scientific Texts

This paper provides an insight into the possibility of how to find ontol...
research
01/05/2018

Tree based classification of tabla strokes

The paper attempts to validate the effectiveness of tree classifiers to ...
research
08/16/2022

Ex-Ante Assessment of Discrimination in Dataset

Data owners face increasing liability for how the use of their data coul...
research
12/23/2019

Customers Churn Prediction in Financial Institution Using Artificial Neural Network

In this study, a predictive model using Multi-layer Perceptron of Artifi...
research
05/22/2019

Augmenting Physiological Time Series Data: A Case Study for Sleep Apnea Detection

Supervised machine learning applications in the health domain often face...
research
06/22/2016

Personalized Prognostic Models for Oncology: A Machine Learning Approach

We have applied a little-known data transformation to subsets of the Sur...
research
03/30/2021

Using Artificial Intelligence to Shed Light on the Star of Biscuits: The Jaffa Cake

Before Brexit, one of the greatest causes of arguments amongst British f...

Please sign up or login with your details

Forgot password? Click here to reset