The current trend of scaling language models involves increasing both
pa...
The BigCode community, an open-scientific collaboration working on the
r...
Recent works successfully leveraged Large Language Models' (LLM) abiliti...
The BigScience Workshop was a value-driven initiative that spanned one a...
Large Language Models (LLMs) play an ever-increasing role in the field o...
The infrastructure necessary for training state-of-the-art models is bec...
Large language models have recently been shown to attain reasonable zero...
The scale, variety, and quantity of publicly-available NLP datasets has ...
Video understanding relies on perceiving the global content and modeling...
Modern deep learning applications require increasingly more compute to t...
State-of-the-art natural language processing (NLP) models often learn to...
Magnitude pruning is a widely used strategy for reducing model size in p...
Natural Language Generation (NLG) models are prone to generating repetit...
Recent advances in modern Natural Language Processing (NLP) research hav...
Recent advances in modern Natural Language Processing (NLP) research hav...
As Transfer Learning from large-scale pre-trained models becomes more
pr...
We introduce a new approach to generative data-driven dialogue systems (...
Much efforts has been devoted to evaluate whether multi-task learning ca...
We reformulate the problem of encoding a multi-scale representation of a...
We consider the task of word-level language modeling and study the
possi...
Convolutional Neural Networks (CNNs) define an exceptionally powerful cl...
In this paper we report on an application of computer algebra in which
m...
In the paper arguments are given why the concept of static evaluation ha...
GoTools is a program which solves life & death problems in the game of G...