Extracting Rules from Neural Networks with Partial Interpretations

04/01/2022
by   Cosimo Persia, et al.
0

We investigate the problem of extracting rules, expressed in Horn logic, from neural network models. Our work is based on the exact learning model, in which a learner interacts with a teacher (the neural network model) via queries in order to learn an abstract target concept, which in our case is a set of Horn rules. We consider partial interpretations to formulate the queries. These can be understood as a representation of the world where part of the knowledge regarding the truthiness of propositions is unknown. We employ Angluin s algorithm for learning Horn rules via queries and evaluate our strategy empirically.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2023

Learning Horn Envelopes via Queries from Large Language Models

We investigate an approach for extracting knowledge from trained neural ...
research
12/24/2020

Weighted defeasible knowledge bases and a multipreference semantics for a deep neural network model

In this paper we investigate the relationships between a multipreferenti...
research
12/14/2021

Answering Queries with Negation over Existential Rules

Ontology-based query answering with existential rules is well understood...
research
10/02/2011

Eclectic Extraction of Propositional Rules from Neural Networks

Artificial Neural Network is among the most popular algorithm for superv...
research
03/10/2020

Cryptanalytic Extraction of Neural Network Models

We argue that the machine learning problem of model extraction is actual...
research
03/27/2021

Explaining the Road Not Taken

It is unclear if existing interpretations of deep neural network models ...
research
08/26/2020

How Useful Are the Machine-Generated Interpretations to General Users? A Human Evaluation on Guessing the Incorrectly Predicted Labels

Explaining to users why automated systems make certain mistakes is impor...

Please sign up or login with your details

Forgot password? Click here to reset