Does Deep Learning Learn to Abstract? A Systematic Probing Framework

02/23/2023
by   Shengnan An, et al.
0

Abstraction is a desirable capability for deep learning models, which means to induce abstract concepts from concrete instances and flexibly apply them beyond the learning context. At the same time, there is a lack of clear understanding about both the presence and further characteristics of this capability in deep learning models. In this paper, we introduce a systematic probing framework to explore the abstraction capability of deep learning models from a transferability perspective. A set of controlled experiments are conducted based on this framework, providing strong evidence that two probed pre-trained language models (PLMs), T5 and GPT2, have the abstraction capability. We also conduct in-depth analysis, thus shedding further light: (1) the whole training phase exhibits a "memorize-then-abstract" two-stage process; (2) the learned abstract concepts are gathered in a few middle-layer attention heads, rather than being evenly distributed throughout the model; (3) the probed abstraction capabilities exhibit robustness against concept mutations, and are more robust to low-level/source-side mutations than high-level/target-side ones; (4) generic pre-training is critical to the emergence of abstraction capability, and PLMs exhibit better abstraction with larger model sizes and data scales.

READ FULL TEXT
research
07/10/2021

Hack The Box: Fooling Deep Learning Abstraction-Based Monitors

Deep learning is a type of machine learning that adapts a deep hierarchy...
research
07/29/2019

Goal-Driven Sequential Data Abstraction

Automatic data abstraction is an important capability for both benchmark...
research
03/16/2023

Kreisel's counter-example to full abstraction of the set-theoretical model of Goedel's system T

The set-theoretical model of Goedel's system T is not fully abstract. We...
research
03/29/2019

A Provable Defense for Deep Residual Networks

We present a training system, which can provably defend significantly la...
research
03/05/2023

Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations

Causal abstraction is a promising theoretical framework for explainable ...
research
09/29/2020

Modelling service-oriented systems and cloud services with Heraklit

Modern and next generation digital infrastructures are technically based...
research
09/27/2021

Abstraction, Reasoning and Deep Learning: A Study of the "Look and Say" Sequence

The ability to abstract, count, and use System 2 reasoning are well-know...

Please sign up or login with your details

Forgot password? Click here to reset