Contextual information integration for stance detection via cross-attention

11/03/2022
by   Tilman Beck, et al.
0

Stance detection deals with the identification of an author's stance towards a target and is applied on various text domains like social media and news. In many cases, inferring the stance is challenging due to insufficient access to contextual information. Complementary context can be found in knowledge bases but integrating the context into pretrained language models is non-trivial due to their graph structure. In contrast, we explore an approach to integrate contextual information as text which aligns better with transformer architectures. Specifically, we train a model consisting of dual encoders which exchange information via cross-attention. This architecture allows for integrating contextual information from heterogeneous sources. We evaluate context extracted from structured knowledge sources and from prompting large language models. Our approach is able to outperform competitive baselines (1.9pp on average) on a large and diverse stance detection benchmark, both (1) in-domain, i.e. for seen targets, and (2) out-of-domain, i.e. for targets unseen during training. Our analysis shows that it is able to regularize for spurious label correlations with target-specific cue words.

READ FULL TEXT
research
06/27/2022

Few-Shot Stance Detection via Target-Aware Prompt Distillation

Stance detection aims to identify whether the author of a text is in fav...
research
06/12/2019

Putting words in context: LSTM language models and lexical ambiguity

In neural network models of language, words are commonly represented usi...
research
03/16/2022

CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual Signals

We propose a framework to modularize the training of neural language mod...
research
01/09/2022

Medication Error Detection Using Contextual Language Models

Medication errors most commonly occur at the ordering or prescribing sta...
research
11/23/2020

Sarcasm detection from user-generated noisy short text

Sentiment analysis of social media comments is very important for review...
research
10/03/2022

Détection de petites cibles par apprentissage profond et critère a contrario

Small target detection is an essential yet challenging task in defense a...
research
08/22/2023

Generalising sequence models for epigenome predictions with tissue and assay embeddings

Sequence modelling approaches for epigenetic profile prediction have rec...

Please sign up or login with your details

Forgot password? Click here to reset