Recent research in mechanistic interpretability has attempted to
reverse...
Transformer-based language models (LMs) are powerful and widely-applicab...
Many NLP datasets have been found to contain shortcuts: simple decision ...
It has become standard to solve NLP tasks by fine-tuning pre-trained lan...
Masked language models conventionally use a masking rate of 15
belief th...
Dense retrieval methods have shown great promise over sparse retrieval
m...