Local Multi-Label Explanations for Random Forest

07/05/2022
by   Nikolaos Mylonas, et al.
0

Multi-label classification is a challenging task, particularly in domains where the number of labels to be predicted is large. Deep neural networks are often effective at multi-label classification of images and textual data. When dealing with tabular data, however, conventional machine learning algorithms, such as tree ensembles, appear to outperform competition. Random forest, being a popular ensemble algorithm, has found use in a wide range of real-world problems. Such problems include fraud detection in the financial domain, crime hotspot detection in the legal sector, and in the biomedical field, disease probability prediction when patient records are accessible. Since they have an impact on people's lives, these domains usually require decision-making systems to be explainable. Random Forest falls short on this property, especially when a large number of tree predictors are used. This issue was addressed in a recent research named LionForests, regarding single label classification and regression. In this work, we adapt this technique to multi-label classification problems, by employing three different strategies regarding the labels that the explanation covers. Finally, we provide a set of qualitative and quantitative experiments to assess the efficacy of this approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2022

Explaining random forest prediction through diverse rulesets

Tree-ensemble algorithms, such as random forest, are effective machine l...
research
09/08/2018

Multi-label Classification of User Reactions in Online News

The increase in the number of Internet users and the strong interaction ...
research
11/15/2019

Multi-Label Learning with Deep Forest

In multi-label learning, each instance is associated with multiple label...
research
04/13/2021

Conclusive Local Interpretation Rules for Random Forests

In critical situations involving discrimination, gender inequality, econ...
research
11/03/2020

Deep tree-ensembles for multi-output prediction

Recently, deep neural networks have expanded the state-of-art in various...
research
07/19/2022

Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Semi-supervised learning (SSL) is a common approach to learning predicti...
research
05/02/2022

Using Machine Learning to Evaluate Real Estate Prices Using Location Big Data

With everyone trying to enter the real estate market nowadays, knowing t...

Please sign up or login with your details

Forgot password? Click here to reset