Green Accelerated Hoeffding Tree

05/06/2022
by   Eva García-Martín, et al.
0

State-of-the-art machine learning solutions mainly focus on creating highly accurate models without constraints on hardware resources. Stream mining algorithms are designed to run on resource-constrained devices, thus a focus on low power and energy and memory-efficient is essential. The Hoeffding tree algorithm is able to create energy-efficient models, but at the cost of less accurate trees in comparison to their ensembles counterpart. Ensembles of Hoeffding trees, on the other hand, create a highly accurate forest of trees but consume five times more energy on average. An extension that tried to obtain similar results to ensembles of Hoeffding trees was the Extremely Fast Decision Tree (EFDT). This paper presents the Green Accelerated Hoeffding Tree (GAHT) algorithm, an extension of the EFDT algorithm with a lower energy and memory footprint and the same (or higher for some datasets) accuracy levels. GAHT grows the tree setting individual splitting criteria for each node, based on the distribution of the number of instances over each particular leaf. The results show that GAHT is able to achieve the same competitive accuracy results compared to EFDT and ensembles of Hoeffding trees while reducing the energy consumption up to 70

READ FULL TEXT
research
08/03/2018

Hoeffding Trees with nmin adaptation

Machine learning software accounts for a significant amount of energy co...
research
06/16/2023

Dynamic Decision Tree Ensembles for Energy-Efficient Inference on IoT Edge Nodes

With the increasing popularity of Internet of Things (IoT) devices, ther...
research
05/31/2022

ForestPrune: Compact Depth-Controlled Tree Ensembles

Tree ensembles are versatile supervised learning algorithms that achieve...
research
05/16/2018

Strict Very Fast Decision Tree: a memory conservative algorithm for data stream mining

Dealing with memory and time constraints are current challenges when lea...
research
12/07/2021

Shrub Ensembles for Online Classification

Online learning algorithms have become a ubiquitous tool in the machine ...
research
11/10/2020

PACSET (Packed Serialized Trees): Reducing Inference Latency for Tree Ensemble Deployment

We present methods to serialize and deserialize tree ensembles that opti...
research
10/28/2022

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks

The events of recent years have highlighted the importance of telemedici...

Please sign up or login with your details

Forgot password? Click here to reset