MUC-driven Feature Importance Measurement and Adversarial Analysis for Random Forest

02/25/2022
by   Shucen Ma, et al.
0

The broad adoption of Machine Learning (ML) in security-critical fields demands the explainability of the approach. However, the research on understanding ML models, such as Random Forest (RF), is still in its infant stage. In this work, we leverage formal methods and logical reasoning to develop a novel model-specific method for explaining the prediction of RF. Our approach is centered around Minimal Unsatisfiable Cores (MUC) and provides a comprehensive solution for feature importance, covering local and global aspects, and adversarial sample analysis. Experimental results on several datasets illustrate the high quality of our feature importance measurement. We also demonstrate that our adversarial analysis outperforms the state-of-the-art method. Moreover, our method can produce a user-centered report, which helps provide recommendations in real-life applications.

READ FULL TEXT

page 12

page 14

research
11/14/2018

Probabilistic Random Forest: A machine learning algorithm for noisy datasets

Machine learning (ML) algorithms become increasingly important in the an...
research
05/11/2023

How to out-perform default random forest regression: choosing hyperparameters for applications in large-sample hydrology

Predictions are a central part of water resources research. Historically...
research
05/30/2023

Sensitivity Analysis of RF+clust for Leave-one-problem-out Performance Prediction

Leave-one-problem-out (LOPO) performance prediction requires machine lea...
research
10/08/2020

Exploring Sensitivity of ICF Outputs to Design Parameters in Experiments Using Machine Learning

Building a sustainable burn platform in inertial confinement fusion (ICF...
research
02/10/2021

Feature Analyses and Modelling of Lithium-ion Batteries Manufacturing based on Random Forest Classification

Lithium-ion battery manufacturing is a highly complicated process with s...
research
12/10/2020

A machine learning approach to galaxy properties: Joint redshift - stellar mass probability distributions with Random Forest

We demonstrate that highly accurate joint redshift - stellar mass PDFs c...
research
10/14/2014

Enhanced Random Forest with Image/Patch-Level Learning for Image Understanding

Image understanding is an important research domain in the computer visi...

Please sign up or login with your details

Forgot password? Click here to reset