Downsampling Strategies are Crucial for Word Embedding Reliability

08/21/2018

∙

The reliability of word embeddings algorithms, i.e., their ability to provide consistent computational judgments of word similarity when trained repeatedly on the same data set, has recently raised concerns. We compared the effect of probabilistic and weighting as downsampling strategies. We found the latter to provide superior reliability while being competitive in accuracy when applied to singular value decomposition-based embeddings

READ FULL TEXT

Downsampling Strategies are Crucial for Word Embedding Reliability

Sign in with Google

Consider DeepAI Pro