Post-hoc Concept Bottleneck Models

05/31/2022
by   Mert Yüksekgönül, et al.
0

Concept Bottleneck Models (CBMs) map the inputs onto a set of interpretable concepts (“the bottleneck”) and use the concepts to make predictions. A concept bottleneck enhances interpretability since it can be investigated to understand what concepts the model "sees" in an input and which of these concepts are deemed important. However, CBMs are restrictive in practice as they require concept labels in the training data to learn the bottleneck and do not leverage strong pretrained models. Moreover, CBMs often do not match the accuracy of an unrestricted neural network, reducing the incentive to deploy them in practice. In this work, we address the limitations of CBMs by introducing Post-hoc Concept Bottleneck models (PCBMs). We show that we can turn any neural network into a PCBM without sacrificing model performance while still retaining interpretability benefits. When concept annotation is not available on the training data, we show that PCBM can transfer concepts from other datasets or from natural language descriptions of concepts. PCBM also enables users to quickly debug and update the model to reduce spurious correlations and improve generalization to new (potentially different) data. Through a model-editing user study, we show that editing PCBMs via concept-level feedback can provide significant performance gains without using any data from the target domain or model retraining.

READ FULL TEXT
research
05/10/2021

Do Concept Bottleneck Models Learn as Intended?

Concept bottleneck models map from raw inputs to concepts, and then from...
research
09/19/2022

Concept Embedding Models

Deploying AI-powered systems requires trustworthy models supporting effe...
research
08/25/2023

Learning to Intervene on Concept Bottlenecks

While traditional deep learning models often lack interpretability, conc...
research
08/24/2023

Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions

Variational Information Pursuit (V-IP) is a framework for making interpr...
research
04/12/2023

Label-Free Concept Bottleneck Models

Concept bottleneck models (CBM) are a popular way of creating more inter...
research
05/20/2023

Collaborative Development of NLP models

Despite substantial advancements, Natural Language Processing (NLP) mode...
research
11/07/2022

Towards learning to explain with concept bottleneck models: mitigating information leakage

Concept bottleneck models perform classification by first predicting whi...

Please sign up or login with your details

Forgot password? Click here to reset