Mini-Batch Consistent Slot Set Encoder for Scalable Set Encoding

03/02/2021
by   Bruno Andreis, et al.
0

Most existing set encoding algorithms operate under the assumption that all the elements of the set are accessible during training and inference. Additionally, it is assumed that there are enough computational resources available for concurrently processing sets of large cardinality. However, both assumptions fail when the cardinality of the set is prohibitively large such that we cannot even load the set into memory. In more extreme cases, the set size could be potentially unlimited, and the elements of the set could be given in a streaming manner, where the model receives subsets of the full set data at irregular intervals. To tackle such practical challenges in large-scale set encoding, we go beyond the usual constraints of invariance and equivariance and introduce a new property termed Mini-Batch Consistency that is required for large scale mini-batch set encoding. We present a scalable and efficient set encoding mechanism that is amenable to mini-batch processing with respect to set elements and capable of updating set representations as more data arrives. The proposed method respects the required symmetries of invariance and equivariance as well as being Mini-Batch Consistent for random partitions of the input set. We perform extensive experiments and show that our method is computationally efficient and results in rich set encoding representations for set-structured data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2017

Streaming Bayesian inference: theoretical limits and mini-batch approximate message-passing

In statistical learning for real-world large-scale data problems, one mu...
research
08/26/2022

Universal Mini-Batch Consistency for Set Encoding Functions

Previous works have established solid foundations for neural set functio...
research
02/09/2019

Mini-batch learning of exponential family finite mixture models

Mini-batch algorithms have become increasingly popular due to the requir...
research
08/22/2021

An Efficient Mini-batch Method via Partial Transportation

Mini-batch optimal transport (m-OT) has been widely used recently to dea...
research
10/28/2021

Cross-Batch Negative Sampling for Training Two-Tower Recommenders

The two-tower architecture has been widely applied for learning item and...
research
04/02/2023

Mini-batch k-means terminates within O(d/ε) iterations

We answer the question: "Does local progress (on batches) imply global p...
research
07/10/2022

NGAME: Negative Mining-aware Mini-batching for Extreme Classification

Extreme Classification (XC) seeks to tag data points with the most relev...

Please sign up or login with your details

Forgot password? Click here to reset