Variation and generality in encoding of syntactic anomaly information in sentence embeddings

11/12/2021
by   Qinxuan Wu, et al.
0

While sentence anomalies have been applied periodically for testing in NLP, we have yet to establish a picture of the precise status of anomaly information in representations from NLP models. In this paper we aim to fill two primary gaps, focusing on the domain of syntactic anomalies. First, we explore fine-grained differences in anomaly encoding by designing probing tasks that vary the hierarchical level at which anomalies occur in a sentence. Second, we test not only models' ability to detect a given anomaly, but also the generality of the detected anomaly signal, by examining transfer between distinct anomaly types. Results suggest that all models encode some information supporting anomaly detection, but detection performance varies between anomalies, and only representations from more recent transformer models show signs of generalized knowledge of anomalies. Follow-up analyses support the notion that these models pick up on a legitimate, general notion of sentence oddity, while coarser-grained word position information is likely also a contributor to the observed anomaly detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2018

Precision and Recall for Range-Based Anomaly Detection

Classical anomaly detection is principally concerned with point-based an...
research
06/14/2023

SaliencyCut: Augmenting Plausible Anomalies for Open-set Fine-Grained Anomaly Detection

Open-set fine-grained anomaly detection is a challenging task that requi...
research
11/25/2022

A Deep Learning Anomaly Detection Method in Textual Data

In this article, we propose using deep learning and transformer architec...
research
04/28/2021

PANDA : Perceptually Aware Neural Detection of Anomalies

Semi-supervised methods of anomaly detection have seen substantial advan...
research
01/13/2021

Anomaly Detection Support Using Process Classification

Anomaly detection systems need to consider a lot of information when sca...
research
05/16/2021

How is BERT surprised? Layerwise detection of linguistic anomalies

Transformer language models have shown remarkable ability in detecting w...
research
09/21/2017

AutoPerf: A Generalized Zero-Positive Learning System to Detect Software Performance Anomalies

In this paper, we present AutoPerf, a generalized software performance a...

Please sign up or login with your details

Forgot password? Click here to reset