Sketching Word Vectors Through Hashing

05/11/2017
by   Behrang QasemiZadeh, et al.
0

We propose a new fast word embedding technique using hash functions. The method is a derandomization of a new type of random projections: By disregarding the classic constraint used in designing random projections (i.e., preserving pairwise distances in a particular normed space), our solution exploits extremely sparse non-negative random projections. Our experiments show that the proposed method can achieve competitive results, comparable to neural embedding learning techniques, however, with only a fraction of the computational complexity of these methods. While the proposed derandomization enhances the computational and space complexity of our method, the possibility of applying weighting methods such as positive pointwise mutual information (PPMI) to our models after their construction (and at a reduced dimensionality) imparts a high discriminatory power to the resulting embeddings. Obviously, this method comes with other known benefits of random projection-based techniques such as ease of update.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2017

Hash Embeddings for Efficient Word Representations

We present hash embeddings, an efficient method for representing words i...
research
11/25/2019

Random projections: data perturbation for classification problems

Random projections offer an appealing and flexible approach to a wide ra...
research
04/14/2014

Random forests with random projections of the output space for high dimensional multi-label classification

We adapt the idea of random projections applied to the output space, so ...
research
12/08/2022

Better Hit the Nail on the Head than Beat around the Bush: Removing Protected Attributes with a Single Projection

Bias elimination and recent probing studies attempt to remove specific i...
research
09/02/2019

Analysis of SparseHash: an efficient embedding of set-similarity via sparse projections

Embeddings provide compact representations of signals in order to perfor...
research
10/02/2012

Distributed High Dimensional Information Theoretical Image Registration via Random Projections

Information theoretical measures, such as entropy, mutual information, a...
research
11/05/2018

Kernel Conjugate Gradient Methods with Random Projections

We propose and study kernel conjugate gradient methods (KCGM) with rando...

Please sign up or login with your details

Forgot password? Click here to reset