Evaluating Word Embedding Models: Methods and Experimental Results

01/28/2019
by   Bin Wang, et al.
0

Extensive evaluation on a large number of word embedding models for language processing applications is conducted in this work. First, we introduce popular word embedding models and discuss desired properties of word models and evaluation methods (or evaluators). Then, we categorize evaluators into intrinsic and extrinsic two types. Intrinsic evaluators test the quality of a representation independent of specific natural language processing tasks while extrinsic evaluators use word embeddings as input features to a downstream task and measure changes in performance metrics specific to that task. We report experimental results of intrinsic and extrinsic evaluators on six word embedding models. It is shown that different evaluators focus on different aspects of word models, and some are more correlated with natural language processing tasks. Finally, we adopt correlation analysis to study performance consistency of extrinsic and intrinsic evalutors.

READ FULL TEXT
research
08/20/2017

Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks

Word embeddings have been found to provide meaningful representations fo...
research
05/08/2020

Comparative Analysis of Word Embeddings for Capturing Word Similarities

Distributed language representation has become the most widely used tech...
research
06/24/2016

Evaluation method of word embedding by roots and affixes

Word embedding has been shown to be remarkably effective in a lot of Nat...
research
05/12/2016

On the Convergent Properties of Word Embedding Methods

Do word embeddings converge to learn similar things over different initi...
research
10/24/2018

Local Homology of Word Embeddings

Topological data analysis (TDA) has been widely used to make progress on...
research
12/31/2020

Intrinsic Bias Metrics Do Not Correlate with Application Bias

Natural Language Processing (NLP) systems learn harmful societal biases ...
research
04/05/2019

A Literature Study of Embeddings on Source Code

Natural language processing has improved tremendously after the success ...

Please sign up or login with your details

Forgot password? Click here to reset