KSAT: Knowledge-infused Self Attention Transformer – Integrating Multiple Domain-Specific Contexts

10/09/2022

∙

Domain-specific language understanding requires integrating multiple pieces of relevant contextual information. For example, we see both suicide and depression-related behavior (multiple contexts) in the text “I have a gun and feel pretty bad about my life, and it wouldn't be the worst thing if I didn't wake up tomorrow”. Domain specificity in self-attention architectures is handled by fine-tuning on excerpts from relevant domain specific resources (datasets and external knowledge - medical textbook chapters on mental health diagnosis related to suicide and depression). We propose a modified self-attention architecture Knowledge-infused Self Attention Transformer (KSAT) that achieves the integration of multiple domain-specific contexts through the use of external knowledge sources. KSAT introduces knowledge-guided biases in dedicated self-attention layers for each knowledge source to accomplish this. In addition, KSAT provides mechanics for controlling the trade-off between learning from data and learning from knowledge. Our quantitative and qualitative evaluations show that (1) the KSAT architecture provides novel human-understandable ways to precisely measure and visualize the contributions of the infused domain contexts, and (2) KSAT performs competitively with other knowledge-infused baselines and significantly outperforms baselines that use fine-tuning for domain-specific tasks.

READ FULL TEXT

KSAT: Knowledge-infused Self Attention Transformer – Integrating Multiple Domain-Specific Contexts

CNRL at SemEval-2020 Task 5: Modelling Causal Reasoning in Language with Multi-Head Self-Attention Weights based Counterfactual Detection

SesameBERT: Attention for Anywhere

Knowledge-Infused Self Attention Transformers

Relative Molecule Self-Attention Transformer

Augmented Large Language Models with Parametric Knowledge Guiding

Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention

Self-Repetition in Abstractive Neural Summarizers

KSAT: Knowledge-infused Self Attention Transformer – Integrating Multiple Domain-Specific Contexts

Related Research

CNRL at SemEval-2020 Task 5: Modelling Causal Reasoning in Language with Multi-Head Self-Attention Weights based Counterfactual Detection

SesameBERT: Attention for Anywhere

Knowledge-Infused Self Attention Transformers

Relative Molecule Self-Attention Transformer

Augmented Large Language Models with Parametric Knowledge Guiding

Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention

Self-Repetition in Abstractive Neural Summarizers