Can LLMs facilitate interpretation of pre-trained language models?

05/22/2023
by   Basel Mousi, et al.
0

Work done to uncover the knowledge encoded within pre-trained language models, rely on annotated corpora or human-in-the-loop methods. However, these approaches are limited in terms of scalability and the scope of interpretation. We propose using a large language model, ChatGPT, as an annotator to enable fine-grained interpretation analysis of pre-trained language models. We discover latent concepts within pre-trained language models by applying hierarchical clustering over contextualized representations and then annotate these concepts using GPT annotations. Our findings demonstrate that ChatGPT produces accurate and semantically richer annotations compared to human-annotated concepts. Additionally, we showcase how GPT-based annotations empower interpretation analysis methodologies of which we demonstrate two: probing framework and neuron interpretation. To facilitate further exploration and experimentation in this field, we have made available a substantial ConceptNet dataset comprising 39,000 annotated latent concepts.

READ FULL TEXT

page 3

page 5

page 13

page 16

research
06/27/2022

Analyzing Encoded Concepts in Transformer Language Models

We propose a novel framework ConceptX, to analyze how latent concepts ar...
research
08/20/2023

Scaled-up Discovery of Latent Concepts in Deep NLP Models

Pre-trained language models (pLMs) learn intricate patterns and contextu...
research
11/12/2022

ConceptX: A Framework for Latent Concept Analysis

The opacity of deep neural networks remains a challenge in deploying sol...
research
10/05/2022

COMPS: Conceptual Minimal Pair Sentences for testing Property Knowledge and Inheritance in Pre-trained Language Models

A characteristic feature of human semantic memory is its ability to not ...
research
08/31/2021

It's not Rocket Science : Interpreting Figurative Language in Narratives

Figurative language is ubiquitous in English. Yet, the vast majority of ...
research
05/05/2022

Assistive Recipe Editing through Critiquing

There has recently been growing interest in the automatic generation of ...
research
04/12/2022

Mining Logical Event Schemas From Pre-Trained Language Models

We present NESL (the Neuro-Episodic Schema Learner), an event schema lea...

Please sign up or login with your details

Forgot password? Click here to reset