Pretrained neural models such as BERT, when fine-tuned to perform natura...
BERTs of a feather do not generalize together: Large variability in generalization across models with similar test set performance
If the same neural architecture is trained multiple times on the same da...
Junghyun Minis this you? claim profile