The Gumbel-max trick is a method to draw a sample from a categorical
dis...
Training large-scale mixture of experts models efficiently on modern har...
Routing problems are a class of combinatorial problems with many practic...
We derive an unbiased estimator for expectations over discrete random
va...
The well-known Gumbel-Max trick for sampling from a categorical distribu...