Concept-based explainability for an EEG transformer model

07/24/2023
by   Anders Gjølbye Madsen, et al.
0

Deep learning models are complex due to their size, structure, and inherent randomness in training procedures. Additional complexity arises from the selection of datasets and inductive biases. Addressing these challenges for explainability, Kim et al. (2018) introduced Concept Activation Vectors (CAVs), which aim to understand deep models' internal states in terms of human-aligned concepts. These concepts correspond to directions in latent space, identified using linear discriminants. Although this method was first applied to image classification, it was later adapted to other domains, including natural language processing. In this work, we attempt to apply the method to electroencephalogram (EEG) data for explainability in Kostas et al.'s BENDR (2021), a large-scale transformer model. A crucial part of this endeavor involves defining the explanatory concepts and selecting relevant datasets to ground concepts in the latent space. Our focus is on two mechanisms for EEG concept formation: the use of externally labeled EEG datasets, and the application of anatomically defined concepts. The former approach is a straightforward generalization of methods used in image classification, while the latter is novel and specific to EEG. We present evidence that both approaches to concept formation yield valuable insights into the representations learned by deep EEG models.

READ FULL TEXT

page 5

page 6

research
07/13/2023

Uncovering Unique Concept Vectors through Latent Space Decomposition

Interpreting the inner workings of deep learning models is crucial for e...
research
04/29/2022

Concept Activation Vectors for Generating User-Defined 3D Shapes

We explore the interpretability of 3D geometric deep learning models in ...
research
12/14/2022

Lorentz Group Equivariant Autoencoders

There has been significant work recently in developing machine learning ...
research
03/04/2022

A streamable large-scale clinical EEG dataset for Deep Learning

Deep Learning has revolutionized various fields, including Computer Visi...
research
02/07/2022

PatClArC: Using Pattern Concept Activation Vectors for Noise-Robust Model Debugging

State-of-the-art machine learning models are commonly (pre-)trained on l...
research
07/25/2023

Neural Memory Decoding with EEG Data and Representation Learning

We describe a method for the neural decoding of memory from EEG data. Us...
research
05/15/2020

Finding Experts in Transformer Models

In this work we study the presence of expert units in pre-trained Transf...

Please sign up or login with your details

Forgot password? Click here to reset