Comparison of Classification Methods for Very High-Dimensional Data in Sparse Random Projection Representation

12/18/2019
by   Anton Akusok, et al.
0

The big data trend has inspired feature-driven learning tasks, which cannot be handled by conventional machine learning models. Unstructured data produces very large binary matrices with millions of columns when converted to vector form. However, such data is often sparse, and hence can be manageable through the use of sparse random projections. This work studies efficient non-iterative and iterative methods suitable for such data, evaluating the results on two representative machine learning tasks with millions of samples and features. An efficient Jaccard kernel is introduced as an alternative to the sparse random projection. Findings indicate that non-iterative methods can find larger, more accurate models than iterative methods in different application scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2021

Function Approximation via Sparse Random Features

Random feature methods have been successful in various machine learning ...
research
03/09/2022

SparseChem: Fast and accurate machine learning model for small molecules

SparseChem provides fast and accurate machine learning models for bioche...
research
06/29/2020

Binary Random Projections with Controllable Sparsity Patterns

Random projection is often used to project higher-dimensional vectors on...
research
07/27/2019

Modeling Winner-Take-All Competition in Sparse Binary Projections

Inspired by the advances in biological science, the study of sparse bina...
research
04/20/2016

Random Projection Estimation of Discrete-Choice Models with Large Choice Sets

We introduce sparse random projection, an important dimension-reduction ...
research
11/13/2017

3D Shape Classification Using Collaborative Representation based Projections

A novel 3D shape classification scheme, based on collaborative represent...
research
06/12/2016

Comparison of Several Sparse Recovery Methods for Low Rank Matrices with Random Samples

In this paper, we will investigate the efficacy of IMAT (Iterative Metho...

Please sign up or login with your details

Forgot password? Click here to reset