Inducing Meaningful Units from Character Sequences with Slot Attention

02/01/2021
by   Melika Behjati, et al.
0

Characters do not convey meaning, but sequences of characters do. We propose an unsupervised distributional method to learn the abstract meaning-bearing units in a sequence of characters. Rather than segmenting the sequence, this model discovers continuous representations of the "objects" in the sequence, using a recently proposed architecture for object discovery in images called Slot Attention. We train our model on different languages and evaluate the quality of the obtained representations with probing classifiers. Our experiments show promising results in the ability of our units to capture meaning at a higher level of abstraction.

READ FULL TEXT
research
05/28/2017

Neural Semantic Parsing by Character-based Translation: Experiments with Abstract Meaning Representations

We evaluate the character-level translation method for neural semantic p...
research
11/02/2022

Neural Block-Slot Representations

In this paper, we propose a novel object-centric representation, called ...
research
07/13/2020

A theory of interaction semantics

The aim of this article is to delineate a theory of interaction semantic...
research
03/05/2022

Extracting linguistic speech patterns of Japanese fictional characters using subword units

This study extracted and analyzed the linguistic speech patterns that ch...
research
12/19/2022

Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training

Language tasks involving character-level manipulations (e.g., spelling c...
research
11/02/2020

Sequence-to-Sequence Networks Learn the Meaning of Reflexive Anaphora

Reflexive anaphora present a challenge for semantic interpretation: thei...
research
08/12/2019

LSTM vs. GRU vs. Bidirectional RNN for script generation

Scripts are an important part of any TV series. They narrate movements, ...

Please sign up or login with your details

Forgot password? Click here to reset