Word Embedding based Edit Distance

10/25/2018
by   Yilin Niu, et al.
0

Text similarity calculation is a fundamental problem in natural language processing and related fields. In recent years, deep neural networks have been developed to perform the task and high performances have been achieved. The neural networks are usually trained with labeled data in supervised learning, and creation of labeled data is usually very costly. In this short paper, we address unsupervised learning for text similarity calculation. We propose a new method called Word Embedding based Edit Distance (WED), which incorporates word embedding into edit distance. Experiments on three benchmark datasets show WED outperforms state-of-the-art unsupervised methods including edit distance, TF-IDF based cosine, word embedding based cosine, Jaccard index, etc.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2019

Unsupervised Lemmatization as Embeddings-Based Word Clustering

We focus on the task of unsupervised lemmatization, i.e. grouping togeth...
research
10/24/2018

Local Homology of Word Embeddings

Topological data analysis (TDA) has been widely used to make progress on...
research
07/05/2018

A Review of Different Word Embeddings for Sentiment Classification using Deep Learning

The web is loaded with textual content, and Natural Language Processing ...
research
01/31/2021

Introduction of a novel word embedding approach based on technology labels extracted from patent data

Diversity in patent language is growing and makes finding synonyms for c...
research
08/13/2018

Angular-Based Word Meta-Embedding Learning

Ensembling word embeddings to improve distributed word representations h...
research
11/01/2019

Finding the most similar textual documents using Case-Based Reasoning

In recent years, huge amounts of unstructured textual data on the Intern...
research
04/14/2020

Extending Text Informativeness Measures to Passage Interestingness Evaluation (Language Model vs. Word Embedding)

Standard informativeness measures used to evaluate Automatic Text Summar...

Please sign up or login with your details

Forgot password? Click here to reset