While recent language models have the ability to take long contexts as i...
Internet links enable users to deepen their understanding of a topic by
...
Generative search engines directly generate responses to user queries, a...
Recent work has observed that pre-trained models have higher
out-of-dist...
For interpreting the behavior of a probabilistic model, it is useful to
...
There is growing evidence that pretrained language models improve
task-s...
Datasets are not only resources for training accurate, deployable system...
Segmentation and (segment) labeling are generally treated separately in
...
Standard test sets for supervised learning evaluate in-distribution
gene...
Machine comprehension of texts longer than a single sentence often requi...
Several datasets have recently been constructed to expose brittleness in...
Contextual word representations derived from large-scale neural language...
Most statistical machine translation systems cannot translate words that...
While recurrent neural networks have found success in a variety of natur...
We present a novel method for obtaining high-quality, domain-targeted
mu...