Fine-tuned transformer models have shown superior performances in many
n...
Deep learning (DL) models for medical image segmentation are highly
infl...
This paper studies how to learn parameters in diagonal Gaussian mixture
...
Traditional (unstructured) pruning methods for a Transformer model focus...
We present Meena, a multi-turn open-domain chatbot trained end-to-end on...
Hermitian tensors are generalizations of Hermitian matrices, but they ha...
We propose a generalization of the best arm identification problem in
st...
Deep neural networks for machine comprehension typically utilizes only w...