Language Invariant Properties in Natural Language Processing

09/27/2021
by   Federico Bianchi, et al.
2

Meaning is context-dependent, but many properties of language (should) remain the same even if we transform the context. For example, sentiment, entailment, or speaker properties should be the same in a translation and original of a text. We introduce language invariant properties: i.e., properties that should not change when we transform text, and how they can be used to quantitatively evaluate the robustness of transformation algorithms. We use translation and paraphrasing as transformation examples, but our findings apply more broadly to any transformation. Our results indicate that many NLP transformations change properties like author characteristics, i.e., make them sound more male. We believe that studying these properties will allow NLP to address both social factors and pragmatic aspects of language. We also release an application suite that can be used to evaluate the invariance of transformation applications.

READ FULL TEXT

page 4

page 5

research
01/16/2013

A Rhetorical Analysis Approach to Natural Language Processing

The goal of this research was to find a way to extend the capabilities o...
research
03/21/2021

TextFlint: Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing

Various robustness evaluation methodologies from different perspectives ...
research
11/07/2019

Transformation of Dense and Sparse Text Representations

Sparsity is regarded as a desirable property of representations, especia...
research
03/19/2018

Dynamic Natural Language Processing with Recurrence Quantification Analysis

Writing and reading are dynamic processes. As an author composes a text,...
research
11/10/2022

An Inclusive Notion of Text

Natural language processing researchers develop models of grammar, meani...
research
04/24/2023

Topological properties and organizing principles of semantic networks

Interpreting natural language is an increasingly important task in compu...
research
08/05/2019

Processamento de linguagem natural em Português e aprendizagem profunda para o domínio de Óleo e Gás

Over the last few decades, institutions around the world have been chall...

Please sign up or login with your details

Forgot password? Click here to reset