Architectures of Meaning, A Systematic Corpus Analysis of NLP Systems

07/16/2021
by   Oskar Wysocki, et al.
12

This paper proposes a novel statistical corpus analysis framework targeted towards the interpretation of Natural Language Processing (NLP) architectural patterns at scale. The proposed approach combines saturation-based lexicon construction, statistical corpus analysis methods and graph collocations to induce a synthesis representation of NLP architectural patterns from corpora. The framework is validated in the full corpus of Semeval tasks and demonstrated coherent architectural patterns which can be used to answer architectural questions on a data-driven fashion, providing a systematic mechanism to interpret a largely dynamic and exponentially growing field.

READ FULL TEXT

page 1

page 3

page 6

page 9

page 10

page 12

page 19

page 20

research
05/28/2020

What is SemEval evaluating? A Systematic Analysis of Evaluation Campaigns in NLP

SemEval is the primary venue in the NLP community for the proposal of ne...
research
09/25/2022

Corpus-based Metaphor Analysis through Graph Theoretical Methods

As a contribution to metaphor analysis, we introduce a statistical, data...
research
05/22/2023

A Diachronic Analysis of the NLP Research Paradigm Shift: When, How, and Why?

Understanding the fundamental concepts and trends in a scientific field ...
research
12/11/2021

Architectural Form and Affect: A Spatiotemporal Study of Arousal

How does the form of our surroundings impact the ways we feel? This pape...
research
10/15/2020

Improving Natural Language Processing Tasks with Human Gaze-Guided Neural Attention

A lack of corpora has so far limited advances in integrating human gaze ...
research
10/27/2022

Creating a morphological and syntactic tagged corpus for the Uzbek language

Nowadays, creation of the tagged corpora is becoming one of the most imp...

Please sign up or login with your details

Forgot password? Click here to reset