Projected Hamming Dissimilarity for Bit-Level Importance Coding in Collaborative Filtering

by   Christian Hansen, et al.

When reasoning about tasks that involve large amounts of data, a common approach is to represent data items as objects in the Hamming space where operations can be done efficiently and effectively. Object similarity can then be computed by learning binary representations (hash codes) of the objects and computing their Hamming distance. While this is highly efficient, each bit dimension is equally weighted, which means that potentially discriminative information of the data is lost. A more expressive alternative is to use real-valued vector representations and compute their inner product; this allows varying the weight of each dimension but is many magnitudes slower. To fix this, we derive a new way of measuring the dissimilarity between two objects in the Hamming space with binary weighting of each dimension (i.e., disabling bits): we consider a field-agnostic dissimilarity that projects the vector of one object onto the vector of the other. When working in the Hamming space, this results in a novel projected Hamming dissimilarity, which by choice of projection, effectively allows a binary importance weighting of the hash code of one object through the hash code of the other. We propose a variational hashing model for learning hash codes optimized for this projected Hamming dissimilarity, and experimentally evaluate it in collaborative filtering experiments. The resultant hash codes lead to effectiveness gains of up to +7 in NDCG and +14 collaborative filtering baselines, while requiring no additional storage and no computational overhead compared to using the Hamming distance.


page 1

page 2

page 3

page 4


Compositional Coding for Collaborative Filtering

Efficiency is crucial to the online recommender systems. Representing us...

Representation Learning for Efficient and Effective Similarity Search and Recommendation

How data is represented and operationalized is critical for building com...

HCFRec: Hash Collaborative Filtering via Normalized Flow with Structural Consensus for Efficient Recommendation

The ever-increasing data scale of user-item interactions makes it challe...

Learning to Rank Binary Codes

Binary codes have been widely used in vision problems as a compact featu...

Boosted Locality Sensitive Hashing: Discriminative Binary Codes for Source Separation

Speech enhancement tasks have seen significant improvements with the adv...

Learning Hash Codes via Hamming Distance Targets

We present a powerful new loss function and training scheme for learning...

Learning Binary Codes and Binary Weights for Efficient Classification

This paper proposes a generic formulation that significantly expedites t...

Please sign up or login with your details

Forgot password? Click here to reset