FastForest: Increasing Random Forest Processing Speed While Maintaining Accuracy

04/06/2020
by   Darren Yates, et al.
0

Random Forest remains one of Data Mining's most enduring ensemble algorithms, achieving well-documented levels of accuracy and processing speed, as well as regularly appearing in new research. However, with data mining now reaching the domain of hardware-constrained devices such as smartphones and Internet of Things (IoT) devices, there is continued need for further research into algorithm efficiency to deliver greater processing speed without sacrificing accuracy. Our proposed FastForest algorithm delivers an average 24 processing speed compared with Random Forest whilst maintaining (and frequently exceeding) it on classification accuracy over tests involving 45 datasets. FastForest achieves this result through a combination of three optimising components - Subsample Aggregating ('Subbagging'), Logarithmic Split-Point Sampling and Dynamic Restricted Subspacing. Moreover, detailed testing of Subbagging sizes has found an optimal scalar delivering a positive mix of processing performance and accuracy.

READ FULL TEXT

page 6

page 15

page 17

research
01/10/2022

Application of Machine Learning-Based Pattern Recognition in IoT Devices: Review

The Internet of things (IoT) is a rapidly advancing area of technology t...
research
04/19/2018

A Dynamic Boosted Ensemble Learning Based on Random Forest

We propose Dynamic Boosted Random Forest (DBRF), a novel ensemble algori...
research
09/10/2022

IoT-Shield: A Novel DDoS Detection Approach for IoT-Based Devices

The widespread deployment of sensors and linked items contributes to the...
research
10/26/2020

Data Mining Ice Cubes

IceCube is a 1 km3 scale neutrino telescope located at the geographic So...
research
05/14/2019

Resource-aware Elastic Swap Random Forest for Evolving Data Streams

Continual learning based on data stream mining deals with ubiquitous sou...
research
06/10/2019

DataLearner: A Data Mining and Knowledge Discovery Tool for Android Smartphones and Tablets

Smartphones have become the ultimate 'personal' computer, yet despite th...
research
10/13/2020

Automation of Hemocompatibility Analysis Using Image Segmentation and a Random Forest

The hemocompatibility of blood-contacting medical devices remains one of...

Please sign up or login with your details

Forgot password? Click here to reset