Can the Inference Logic of Large Language Models be Disentangled into Symbolic Concepts?

04/03/2023
by   Wen Shen, et al.
1

In this paper, we explain the inference logic of large language models (LLMs) as a set of symbolic concepts. Many recent studies have discovered that traditional DNNs usually encode sparse symbolic concepts. However, because an LLM has much more parameters than traditional DNNs, whether the LLM also encodes sparse symbolic concepts is still an open problem. Therefore, in this paper, we propose to disentangle the inference score of LLMs for dialogue tasks into a small number of symbolic concepts. We verify that we can use those sparse concepts to well estimate all inference scores of the LLM on all arbitrarily masking states of the input sentence. We also evaluate the transferability of concepts encoded by an LLM and verify that symbolic concepts usually exhibit high transferability across similar input sentences. More crucially, those symbolic concepts can be used to explain the exact reasons accountable for the LLM's prediction errors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2023

Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models

This paper aims to prove the emergence of symbolic concepts in well-trai...
research
02/25/2023

Bayesian Neural Networks Tend to Ignore Complex and Sensitive Concepts

In this paper, we focus on mean-field variational Bayesian Neural Networ...
research
02/25/2023

Does a Neural Network Really Encode Symbolic Concept?

Recently, a series of studies have tried to extract interactions between...
research
11/11/2021

Towards Axiomatic, Hierarchical, and Symbolic Explanation for Deep Models

This paper proposes a hierarchical and symbolic And-Or graph (AOG) to ob...
research
08/27/2023

Symbolic and Language Agnostic Large Language Models

We argue that the relative success of large language models (LLMs) is no...
research
09/12/2023

Stochastic LLMs do not Understand Language: Towards Symbolic, Explainable and Ontologically Based LLMs

In our opinion the exuberance surrounding the relative success of data-d...
research
04/26/2023

Technical Note: Defining and Quantifying AND-OR Interactions for Faithful and Concise Explanation of DNNs

In this technical note, we aim to explain a deep neural network (DNN) by...

Please sign up or login with your details

Forgot password? Click here to reset