The flow of ideas in word embeddings

07/26/2023
by   Debayan Dasgupta, et al.
0

The flow of ideas has been extensively studied by physicists, psychologists, and machine learning engineers. This paper adopts specific tools from microrheology to investigate the similarity-based flow of ideas. We introduce a random walker in word embeddings and study its behavior. Such similarity-mediated random walks through the embedding space show signatures of anomalous diffusion commonly observed in complex structured systems such as biological cells and complex fluids. The paper concludes by proposing the application of popular tools employed in the study of random walks and diffusion of particles under Brownian motion to assess quantitatively the incorporation of diverse ideas in a document. Overall, this paper presents a self-referenced method combining microrheology and machine learning concepts to explore the meandering tendencies of language models and their potential association with creativity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2018

Bilingual Embeddings with Random Walks over Multilingual Wordnets

Bilingual word embeddings represent words of two languages in the same s...
research
09/05/2020

Bio-inspired Structure Identification in Language Embeddings

Word embeddings are a popular way to improve downstream performances in ...
research
06/16/2021

Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study

Several variants of deep neural networks have been successfully employed...
research
10/13/2020

BRUMS at SemEval-2020 Task 3: Contextualised Embeddings forPredicting the (Graded) Effect of Context in Word Similarity

This paper presents the team BRUMS submission to SemEval-2020 Task 3: Gr...
research
01/21/2018

A Survey of Word Embeddings Evaluation Methods

Word embeddings are real-valued word representations able to capture lex...
research
12/01/2020

Intrinsic analysis for dual word embedding space models

Recent word embeddings techniques represent words in a continuous vector...
research
09/18/2015

Word, graph and manifold embedding from Markov processes

Continuous vector representations of words and objects appear to carry s...

Please sign up or login with your details

Forgot password? Click here to reset