Assessing the trade-off between prediction accuracy and interpretability for topic modeling on energetic materials corpora

06/01/2022
by   Monica Puerto, et al.
0

As the amount and variety of energetics research increases, machine aware topic identification is necessary to streamline future research pipelines. The makeup of an automatic topic identification process consists of creating document representations and performing classification. However, the implementation of these processes on energetics research imposes new challenges. Energetics datasets contain many scientific terms that are necessary to understand the context of a document but may require more complex document representations. Secondly, the predictions from classification must be understandable and trusted by the chemists within the pipeline. In this work, we study the trade-off between prediction accuracy and interpretability by implementing three document embedding methods that vary in computational complexity. With our accuracy results, we also introduce local interpretability model-agnostic explanations (LIME) of each prediction to provide a localized understanding of each prediction and to validate classifier decisions with our team of energetics experts. This study was carried out on a novel labeled energetics dataset created and validated by our team of energetics experts.

READ FULL TEXT
research
10/19/2022

Black Box Model Explanations and the Human Interpretability Expectations – An Analysis in the Context of Homicide Prediction

Strategies based on Explainable Artificial Intelligence - XAI have promo...
research
04/11/2022

Research on accurate stereo portrait generation algorithm of scientific research team

In order to smoothly promote the establishment of scientific research pr...
research
05/22/2017

A Unified Approach to Interpreting Model Predictions

Understanding why a model makes a certain prediction can be as crucial a...
research
06/07/2018

Assessing the impact of machine intelligence on human behaviour: an interdisciplinary endeavour

This document contains the outcome of the first Human behaviour and mach...
research
12/02/2020

TAN-NTM: Topic Attention Networks for Neural Topic Modeling

Topic models have been widely used to learn representations from text an...
research
08/11/2018

Document Informed Neural Autoregressive Topic Models

Context information around words helps in determining their actual meani...
research
04/13/2018

Are Abstracts Enough for Hypothesis Generation?

The potential for automatic hypothesis generation (HG) systems to improv...

Please sign up or login with your details

Forgot password? Click here to reset