Syntax and Semantics Meet in the "Middle": Probing the Syntax-Semantics Interface of LMs Through Agentivity

05/29/2023
by   Lindia Tjuatja, et al.
0

Recent advances in large language models have prompted researchers to examine their abilities across a variety of linguistic tasks, but little has been done to investigate how models handle the interactions in meaning across words and larger syntactic forms – i.e. phenomena at the intersection of syntax and semantics. We present the semantic notion of agentivity as a case study for probing such interactions. We created a novel evaluation dataset by utilitizing the unique linguistic properties of a subset of optionally transitive English verbs. This dataset was used to prompt varying sizes of three model classes to see if they are sensitive to agentivity at the lexical level, and if they can appropriately employ these word-level priors given a specific syntactic context. Overall, GPT-3 text-davinci-003 performs extremely well across all experiments, outperforming all other models tested by far. In fact, the results are even better correlated with human judgements than both syntactic and semantic corpus statistics. This suggests that LMs may potentially serve as more useful tools for linguistic annotation, theory testing, and discovery than select corpora for certain tasks.

READ FULL TEXT
research
10/24/2022

The Better Your Syntax, the Better Your Semantics? Probing Pretrained Language Models for the English Comparative Correlative

Construction Grammar (CxG) is a paradigm from cognitive linguistics emph...
research
02/11/2018

Syntax and Semantics of Italian Poetry in the First Half of the 20th Century

In this paper we study, analyse and comment rhetorical figures present i...
research
02/02/2016

The Grail theorem prover: Type theory for syntax and semantics

As the name suggests, type-logical grammars are a grammar formalism base...
research
09/22/2021

Cross-linguistically Consistent Semantic and Syntactic Annotation of Child-directed Speech

While corpora of child speech and child-directed speech (CDS) have enabl...
research
09/11/2016

Divide and...conquer? On the limits of algorithmic approaches to syntactic semantic structure

In computer science, divide and conquer (D&C) is an algorithm design par...
research
10/22/2018

Predictive Linguistic Features of Schizophrenia

Schizophrenia is one of the most disabling and difficult to treat of all...
research
07/31/2015

Spin Glass Models of Syntax and Language Evolution

Using the SSWL database of syntactic parameters of world languages, and ...

Please sign up or login with your details

Forgot password? Click here to reset