Linguistic Matrix Theory

03/28/2017
by   Dimitrios Kartsaklis, et al.
0

Recent research in computational linguistics has developed algorithms which associate matrices with adjectives and verbs, based on the distribution of words in a corpus of text. These matrices are linear operators on a vector space of context words. They are used to construct the meaning of composite expressions from that of the elementary constituents, forming part of a compositional distributional approach to semantics. We propose a Matrix Theory approach to this data, based on permutation symmetry along with Gaussian weights and their perturbations. A simple Gaussian model is tested against word matrices created from a large corpus of text. We characterize the cubic and quartic departures from the model, which we propose, alongside the Gaussian parameters, as signatures for comparison of linguistic corpora. We propose that perturbed Gaussian models with permutation symmetry provide a promising framework for characterizing the nature of universality in the statistical properties of word matrices. The matrix theory framework developed here exploits the view of statistics as zero dimensional perturbative quantum field theory. It perceives language as a physical system realizing a universality class of matrix statistics characterized by permutation symmetry.

READ FULL TEXT
research
02/14/2022

Permutation invariant matrix statistics and computational language tasks

The Linguistic Matrix Theory programme introduced by Kartsaklis, Ramgool...
research
12/19/2019

Gaussianity and typicality in matrix distributional semantics

Constructions in type-driven compositional distributional semantics asso...
research
09/20/2018

Permutation Invariant Gaussian Matrix Models

Permutation invariant Gaussian matrix models were recently developed for...
research
07/03/2023

Learning permutation symmetries with gips in R

The study of hidden structures in data presents challenges in modern sta...
research
03/29/2020

Periodicity of lively quantum walks on cycles with generalized Grover coin

In this paper we extend the study of three state lively quantum walks on...
research
05/17/2021

A CCG-Based Version of the DisCoCat Framework

While the DisCoCat model (Coecke et al., 2010) has been proved a valuabl...
research
07/17/2019

Distribution of the ratio of two consecutive level spacings in orthogonal to unitary crossover ensembles

The ratio of two consecutive level spacings has emerged as a very useful...

Please sign up or login with your details

Forgot password? Click here to reset