SnapToGrid: From Statistical to Interpretable Models for Biomedical Information Extraction

We propose an approach for biomedical information extraction that marries the advantages of machine learning models, e.g., learning directly from data, with the benefits of rule-based approaches, e.g., interpretability. Our approach starts by training a feature-based statistical model, then converts this model to a rule-based variant by converting its features to rules, and "snapping to grid" the feature weights to discrete votes. In doing so, our proposal takes advantage of the large body of work in machine learning, but it produces an interpretable model, which can be directly edited by experts. We evaluate our approach on the BioNLP 2009 event extraction task. Our results show that there is a small performance penalty when converting the statistical model to rules, but the gain in interpretability compensates for that: with minimal effort, human experts improve this model to have similar performance to the statistical model that served as starting point.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/04/2018

On Cognitive Preferences and the Interpretability of Rule-based Models

It is conventional wisdom in machine learning and data mining that logic...
research
11/06/2015

Learning Optimized Or's of And's

Or's of And's (OA) models are comprised of a small number of disjunction...
research
06/29/2017

Interpretability via Model Extraction

The ability to interpret machine learning models has become increasingly...
research
07/19/2023

GUIDO: A Hybrid Approach to Guideline Discovery Ordering from Natural Language Texts

Extracting workflow nets from textual descriptions can be used to simpli...
research
04/26/2021

LCS-DIVE: An Automated Rule-based Machine Learning Visualization Pipeline for Characterizing Complex Associations in Classification

Machine learning (ML) research has yielded powerful tools for training a...
research
02/01/2023

Using Machine Learning to Develop Smart Reflex Testing Protocols

Objective: Reflex testing protocols allow clinical laboratories to perfo...
research
01/31/2022

POTATO: exPlainable infOrmation exTrAcTion framewOrk

We present POTATO, a task- and languageindependent framework for human-i...

Please sign up or login with your details

Forgot password? Click here to reset