A Novel Method of Extracting Topological Features from Word Embeddings

03/29/2020
by   Shafie Gholizadeh, et al.
0

In recent years, topological data analysis has been utilized for a wide range of problems to deal with high dimensional noisy data. While text representations are often high dimensional and noisy, there are only a few work on the application of topological data analysis in natural language processing. In this paper, we introduce a novel algorithm to extract topological features from word embedding representation of text that can be used for text classification. Working on word embeddings, topological data analysis can interpret the embedding high-dimensional space and discover the relations among different embedding dimensions. We will use persistent homology, the most commonly tool from topological data analysis, for our experiment. Examining our topological algorithm on long textual documents, we will show our defined topological features may outperform conventional text mining features.

READ FULL TEXT

page 4

page 7

research
03/29/2020

Topological Data Analysis in Text Classification: Extracting Features with Additive Information

While the strength of Topological Data Analysis has been explored in man...
research
06/03/2019

An Introduction to a New Text Classification and Visualization for Natural Language Processing Using Topological Data Analysis

Topological Data Analysis (TDA) is a novel new and fast growing field of...
research
03/01/2022

Topological Data Analysis for Word Sense Disambiguation

We develop and test a novel unsupervised algorithm for word sense induct...
research
08/22/2022

Dialogue Term Extraction using Transfer Learning and Topological Data Analysis

Goal oriented dialogue systems were originally designed as a natural lan...
research
09/02/2021

Knot invariants and their relations: a topological perspective

This work brings methods from topological data analysis to knot theory a...
research
11/17/2020

Argumentative Topology: Finding Loop(holes) in Logic

Advances in natural language processing have resulted in increased capab...
research
02/07/2021

A Note on Argumentative Topology: Circularity and Syllogisms as Unsolved Problems

In the last couple of years there were a few attempts to apply topologic...

Please sign up or login with your details

Forgot password? Click here to reset