PACSET (Packed Serialized Trees): Reducing Inference Latency for Tree Ensemble Deployment

11/10/2020
by   Meghana Madhyastha, et al.
6

We present methods to serialize and deserialize tree ensembles that optimize inference latency when models are not already loaded into memory. This arises whenever models are larger than memory, but also systematically when models are deployed on low-resource devices, such as in the Internet of Things, or run as Web micro-services where resources are allocated on demand. Our packed serialized trees (PACSET) encode reference locality in the layout of a tree ensemble using principles from external memory algorithms. The layout interleaves correlated nodes across multiple trees, uses leaf cardinality to collocate the nodes on the most popular paths and is optimized for the I/O blocksize. The result is that each I/O yields a higher fraction of useful data, leading to a 2-6 times reduction in classification latency for interactive workloads.

READ FULL TEXT

page 3

page 5

page 6

page 9

research
06/19/2018

Forest Packing: Fast, Parallel Decision Forests

Machine learning has an emerging critical role in high-performance compu...
research
10/27/2020

Realization of Random Forest for Real-Time Evaluation through Tree Framing

The optimization of learning has always been of particular concern for b...
research
05/06/2022

Green Accelerated Hoeffding Tree

State-of-the-art machine learning solutions mainly focus on creating hig...
research
06/16/2023

Dynamic Decision Tree Ensembles for Energy-Efficient Inference on IoT Edge Nodes

With the increasing popularity of Internet of Things (IoT) devices, ther...
research
12/07/2021

Shrub Ensembles for Online Classification

Online learning algorithms have become a ubiquitous tool in the machine ...
research
06/14/2020

ResOT: Resource-Efficient Oblique Trees for Neural Signal Classification

Classifiers that can be implemented on chip with minimal computational a...
research
05/17/2023

A Scalable Method for Readable Tree Layouts

Large tree structures are ubiquitous and real-world relational datasets ...

Please sign up or login with your details

Forgot password? Click here to reset