Data Dimension Reduction makes ML Algorithms efficient

11/17/2022
by   Wisal Khan, et al.
0

Data dimension reduction (DDR) is all about mapping data from high dimensions to low dimensions, various techniques of DDR are being used for image dimension reduction like Random Projections, Principal Component Analysis (PCA), the Variance approach, LSA-Transform, the Combined and Direct approaches, and the New Random Approach. Auto-encoders (AE) are used to learn end-to-end mapping. In this paper, we demonstrate that pre-processing not only speeds up the algorithms but also improves accuracy in both supervised and unsupervised learning. In pre-processing of DDR, first PCA based DDR is used for supervised learning, then we explore AE based DDR for unsupervised learning. In PCA based DDR, we first compare supervised learning algorithms accuracy and time before and after applying PCA. Similarly, in AE based DDR, we compare unsupervised learning algorithm accuracy and time before and after AE representation learning. Supervised learning algorithms including support-vector machines (SVM), Decision Tree with GINI index, Decision Tree with entropy and Stochastic Gradient Descent classifier (SGDC) and unsupervised learning algorithm including K-means clustering, are used for classification purpose. We used two datasets MNIST and FashionMNIST Our experiment shows that there is massive improvement in accuracy and time reduction after pre-processing in both supervised and unsupervised learning.

READ FULL TEXT
research
09/09/2021

Supervised Linear Dimension-Reduction Methods: Review, Extensions, and Comparisons

Principal component analysis (PCA) is a well-known linear dimension-redu...
research
04/05/2019

Is 'Unsupervised Learning' a Misconceived Term?

Is all of machine learning supervised to some degree? The field of machi...
research
11/09/2017

Dimension Reduction of High-Dimensional Datasets Based on Stepwise SVM

The current study proposes a dimension reduction method, stepwise suppor...
research
11/20/2021

Feature selection or extraction decision process for clustering using PCA and FRSD

This paper concerns the critical decision process of extracting or selec...
research
07/29/2013

Borel Isomorphic Dimensionality Reduction of Data and Supervised Learning

In this project we further investigate the idea of reducing the dimensio...
research
11/09/2018

Exploiting Capacity of Sewer System Using Unsupervised Learning Algorithms Combined with Dimensionality Reduction

Exploiting capacity of sewer system using decentralized control is a cos...
research
11/28/2017

Wisdom of the crowd from unsupervised dimension reduction

Wisdom of the crowd, the collective intelligence derived from responses ...

Please sign up or login with your details

Forgot password? Click here to reset