Causal Theories and Structural Data Representations for Improving Out-of-Distribution Classification

09/18/2023
by   Donald Martin Jr., et al.
0

We consider how human-centered causal theories and tools from the dynamical systems literature can be deployed to guide the representation of data when training neural networks for complex classification tasks. Specifically, we use simulated data to show that training a neural network with a data representation that makes explicit the invariant structural causal features of the data generating process of an epidemic system improves out-of-distribution (OOD) generalization performance on a classification task as compared to a more naive approach to data representation. We take these results to demonstrate that using human-generated causal knowledge to reduce the epistemic uncertainty of ML developers can lead to more well-specified ML pipelines. This, in turn, points to the utility of a dynamical systems approach to the broader effort aimed at improving the robustness and safety of machine learning systems via improved ML system development practices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2019

Causal Modeling for Fairness in Dynamical Systems

In this work, we present causal directed acyclic graphs (DAGs) as a unif...
research
05/16/2018

Generalized Strucutral Causal Models

Structural causal models are a popular tool to describe causal relations...
research
01/12/2020

Towards causality-aware predictions in static machine learning tasks: the linear structural causal model case

While counterfactual thinking has been used in ML tasks that aim to pred...
research
06/12/2020

Formalizing Falsification of Causal Structure Theories for Consciousness Across Computational Hierarchies

There is currently a global, multimillion-dollar effort to experimentall...
research
06/23/2022

Invariant Causal Mechanisms through Distribution Matching

Learning representations that capture the underlying data generating pro...
research
06/15/2023

Knowledge Guided Representation Learning and Causal Structure Learning in Soil Science

An improved understanding of soil can enable more sustainable land-use p...
research
11/28/2019

A Generalization Theory based on Independent and Task-Identically Distributed Assumption

Existing generalization theories analyze the generalization performance ...

Please sign up or login with your details

Forgot password? Click here to reset