Retrieving relevant items that match users' queries from billion-scale c...
BERT-style models pre-trained on the general corpus (e.g., Wikipedia) an...
Retrieving relevant targets from an extremely large target set under
com...
It is known that the Langevin dynamics used in MCMC is the gradient flow...
We consider doing Bayesian inference by minimizing the KL divergence on ...
The kernel embedding algorithm is an important component for adapting ke...
Stein variational gradient descent (SVGD) is a nonparametric inference
m...
Thompson sampling has impressive empirical performance for many multi-ar...