Mitigating Prior Errors in Causal Structure Learning: Towards LLM driven Prior Knowledge

06/12/2023
by   Lyuzhou Chen, et al.
0

Causal structure learning, a prominent technique for encoding cause and effect relationships among variables, through Bayesian Networks (BNs). Merely recovering causal structures from real-world observed data lacks precision, while the development of Large Language Models (LLM) is opening a new frontier of causality. LLM presents strong capability in discovering causal relationships between variables with the "text" inputs defining the investigated variables, leading to a potential new hierarchy and new ladder of causality. We aim an critical issue in the emerging topic of LLM based causal structure learning, to tackle erroneous prior causal statements from LLM, which is seldom considered in the current context of expert dominating prior resources. As a pioneer attempt, we propose a BN learning strategy resilient to prior errors without need of human intervention. Focusing on the edge-level prior, we classify the possible prior errors into three types: order-consistent, order-reversed, and irrelevant, and provide their theoretical impact on the Structural Hamming Distance (SHD) under the presumption of sufficient data. Intriguingly, we discover and prove that only the order-reversed error contributes to an increase in a unique acyclic closed structure, defined as a "quasi-circle". Leveraging this insight, a post-hoc strategy is employed to identify the order-reversed prior error by its impact on the increment of "quasi-circles". Through empirical evaluation on both real and synthetic datasets, we demonstrate our strategy's robustness against prior errors. Specifically, we highlight its substantial ability to resist order-reversed errors while maintaining the majority of correct prior knowledge.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2023

From Query Tools to Causal Architects: Harnessing Large Language Models for Advanced Causal Discovery from Data

Large Language Models (LLMs) exhibit exceptional abilities for causal an...
research
05/12/2021

Bayesian Model Averaging for Data Driven Decision Making when Causality is Partially Known

Probabilistic machine learning models are often insufficient to help wit...
research
02/14/2012

Discovering causal structures in binary exclusive-or skew acyclic models

Discovering causal relations among observed variables in a given data se...
research
09/28/2020

CASTLE: Regularization via Auxiliary Causal Graph Discovery

Regularization improves generalization of supervised models to out-of-sa...
research
09/18/2023

Causal Discovery and Prediction: Methods and Algorithms

We are not only observers but also actors of reality. Our capability to ...
research
03/21/2022

Learning latent causal relationships in multiple time series

Identifying the causal structure of systems with multiple dynamic elemen...
research
11/26/2021

Enforcing and Discovering Structure in Machine Learning

The world is structured in countless ways. It may be prudent to enforce ...

Please sign up or login with your details

Forgot password? Click here to reset