Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations

06/21/2018
by   Ting Chen, et al.
0

Conventional embedding methods directly associate each symbol with a continuous embedding vector, which is equivalent to applying a linear transformation based on a "one-hot" encoding of the discrete symbols. Despite its simplicity, such approach yields the number of parameters that grows linearly with the vocabulary size and can lead to overfitting. In this work, we propose a much more compact K-way D-dimensional discrete encoding scheme to replace the "one-hot" encoding. In the proposed "KD encoding", each symbol is represented by a D-dimensional code with a cardinality of K, and the final symbol embedding vector is generated by composing the code embedding vectors. To end-to-end learn semantically meaningful codes, we derive a relaxed discrete optimization approach based on stochastic gradient descent, which can be generally applied to any differentiable computational graph with an embedding layer. In our experiments with various applications from natural language processing to graph convolutional networks, the total size of the embedding layer can be reduced up to 98% while achieving similar or better performance.

READ FULL TEXT
research
11/08/2017

Learning K-way D-dimensional Discrete Code For Compact Embedding Representations

Embedding methods such as word embedding have become pillars for many ap...
research
08/26/2019

Differentiable Product Quantization for End-to-End Embedding Compression

Embedding layer is commonly used to map discrete symbols into continuous...
research
03/07/2019

Multi-Hot Compact Network Embedding

Network embedding, as a promising way of the network representation lear...
research
10/25/2018

Bayesian Compression for Natural Language Processing

In natural language processing, a lot of the tasks are successfully solv...
research
04/04/2019

DAGCN: Dual Attention Graph Convolutional Networks

Graph convolutional networks (GCNs) have recently become one of the most...
research
01/23/2017

dna2vec: Consistent vector representations of variable-length k-mers

One of the ubiquitous representation of long DNA sequence is dividing it...
research
09/14/2022

On Layer-Rainbow Latin Cubes Containing Layer-Rainbow Latin Cubes

Despite the fact that latin cubes have been studied since in the 1940's,...

Please sign up or login with your details

Forgot password? Click here to reset