Efficient distributed representations beyond negative sampling

03/30/2023
by   Lorenzo Dall'Amico, et al.
0

This article describes an efficient method to learn distributed representations, also known as embeddings. This is accomplished minimizing an objective function similar to the one introduced in the Word2Vec algorithm and later adopted in several works. The optimization computational bottleneck is the calculation of the softmax normalization constants for which a number of operations scaling quadratically with the sample size is required. This complexity is unsuited for large datasets and negative sampling is a popular workaround, allowing one to obtain distributed representations in linear time with respect to the sample size. Negative sampling consists, however, in a change of the loss function and hence solves a different optimization problem from the one originally proposed. Our contribution is to show that the sotfmax normalization constants can be estimated in linear time, allowing us to design an efficient optimization strategy to learn distributed representations. We test our approximation on two popular applications related to word and node embeddings. The results evidence competing performance in terms of accuracy with respect to negative sampling with a remarkably lower computational time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2021

Minimizing Nonsmooth Convex Functions with Variable Accuracy

We consider unconstrained optimization problems with nonsmooth and conve...
research
01/01/2015

Communication-Efficient Distributed Optimization of Self-Concordant Empirical Loss

We consider distributed convex optimization problems originated from sam...
research
12/31/2020

A Constant-time Adaptive Negative Sampling

Softmax classifiers with a very large number of classes naturally occur ...
research
02/03/2021

Cleora: A Simple, Strong and Scalable Graph Embedding Scheme

The area of graph embeddings is currently dominated by contrastive learn...
research
06/23/2020

An improved sample size calculation method for score tests in generalized linear models

Self and Mauritsen (1988) developed a sample size determination procedur...
research
09/24/2019

Improving Collaborative Metric Learning with Efficient Negative Sampling

Distance metric learning based on triplet loss has been applied with suc...
research
12/20/2021

String Sampling with Bidirectional String Anchors

The minimizers sampling mechanism is a popular mechanism for string samp...

Please sign up or login with your details

Forgot password? Click here to reset