Over recent years, an increasing amount of compute and data has been pou...
In-context learning (ICL) improves language models' performance on a var...
To study social, economic, and historical questions, researchers in the
...
Autoregressive transformers are spectacular models for short sequences b...
Progress in machine learning has been driven in large part by massive
in...
Recent work has shown that fine-tuning large pre-trained language models...
In NLP, models are usually evaluated by reporting single-number performa...
The extreme multi-label classification (XMC) task aims at tagging conten...
Large language models, which are often trained for hundreds of thousands...
Large-scale autoregressive language models such as GPT-3 are few-shot
le...