Data Segmentation via t-SNE, DBSCAN, and Random Forest

10/26/2020
by   Timothy DeLise, et al.
0

This research proposes a data segmentation technique which is easy to interpret and generalizes well. The technique combines t-SNE, DBSCAN, and Random Forest classifier algorithms to form an end-to-end pipeline that separates data into natural clusters and produces a characteristic profile of each cluster based on the most important features. Out-of-sample cluster labels can be inferred, and the technique generalizes well on real data sets. We describe the algorithm and provide case studies using the Iris and MNIST data sets, as well as real social media site data from Instagram. The main contributions of this work are the explicit identification of clusters from a t-SNE embedding, the cluster profiles, and the treatment of how these clusters generalize to out-of-sample data.

READ FULL TEXT

page 4

page 5

research
03/14/2019

On the Use of Random Forest for Two-Sample Testing

We follow the line of using classifiers for two-sample testing and propo...
research
05/17/2021

Cross-Cluster Weighted Forests

Adapting machine learning algorithms to better handle the presence of na...
research
04/05/2020

Unsupervised and Supervised Learning with the Random Forest Algorithm for Traffic Scenario Clustering and Classification

The goal of this paper is to provide a method, which is able to find cat...
research
03/24/2020

Tree Index: A New Cluster Evaluation Technique

We introduce a cluster evaluation technique called Tree Index. Our Tree ...
research
11/09/2015

Spatially Coherent Random Forests

Spatially Coherent Random Forest (SCRF) extends Random Forest to create ...
research
07/29/2016

Authorship Verification - An Approach based on Random Forest

Authorship attribution, being an important problem in many areas in-clud...
research
04/05/2020

An Unsupervised Random Forest Clustering Technique for Automatic Traffic Scenario Categorization

A modification of the Random Forest algorithm for the categorization of ...

Please sign up or login with your details

Forgot password? Click here to reset