Large-scale clinical data is invaluable to driving many computational
sc...
Self-normalizing discriminative models approximate the normalized probab...
In this study, we introduce a new approach for learning language models ...
The negative sampling (NEG) objective function, used in word2vec, is a
s...
We provide the first extensive evaluation of how using different types o...