Language models often exhibit behaviors that improve performance on a
pr...
Reinforcement learning from human feedback (RLHF) is a technique for tra...
Recent work has shown that computation in language models may be
human-u...
Continual learning is a problem for artificial neural networks that thei...
A principled understanding of generalization in deep learning may requir...