A Topological Data Analysis Based Classifier

11/09/2021
by   Rolando Kindelan, et al.
16

Topological Data Analysis (TDA) is an emergent field that aims to discover topological information hidden in a dataset. TDA tools have been commonly used to create filters and topological descriptors to improve Machine Learning (ML) methods. This paper proposes an algorithm that applies TDA directly to multi-class classification problems, without any further ML stage, showing advantages for imbalanced datasets. The proposed algorithm builds a filtered simplicial complex on the dataset. Persistent Homology (PH) is applied to guide the selection of a sub-complex where unlabeled points obtain the label with the majority of votes from labeled neighboring points. We select 8 datasets with different dimensions, degrees of class overlap and imbalanced samples per class. On average, the proposed TDABC method was better than KNN and weighted-KNN. It behaves competitively with Local SVM and Random Forest baseline classifiers in balanced datasets, and it outperforms all baseline methods classifying entangled and minority classes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2021

Classification based on Topological Data Analysis

Topological Data Analysis (TDA) is an emergent field that aims to discov...
research
12/19/2018

Balanced Random Forest Classifier in WEKA

Data analysis and machine learning have become an integrative part of th...
research
08/22/2019

LoRAS: An oversampling approach for imbalanced datasets

The Synthetic Minority Oversampling TEchnique (SMOTE) is widely-used for...
research
01/16/2020

Robust Topological Descriptors for Machine Learning Prediction of Guest Adsorption in Nanoporous Materials

In recent years, machine learning (ML) for predicting material propertie...
research
09/07/2023

Alzheimer Disease Detection from Raman Spectroscopy of the Cerebrospinal Fluid via Topological Machine Learning

The cerebrospinal fluid (CSF) of 19 subjects who received a clinical dia...
research
09/28/2022

Applying Machine Learning for Duplicate Detection, Throttling and Prioritization of Equipment Commissioning Audits at Fulfillment Network

VQ (Vendor Qualification) and IOQ (Installation and Operation Qualificat...
research
10/09/2021

Topological Data Analysis (TDA) Techniques Enhance Hand Pose Classification from ECoG Neural Recordings

Electrocorticogram (ECoG) well characterizes hand movement intentions an...

Please sign up or login with your details

Forgot password? Click here to reset