Curriculum learning (CL) posits that machine learning models – similar t...
At the staggering pace with which the capabilities of large language mod...
Transformer language models encode the notion of word order using positi...
In NLP, models are usually evaluated by reporting single-number performa...
Interaction between caregivers and children plays a critical role in hum...
There has been a lot of interest in understanding what information is
ca...
Recursive processing is considered a hallmark of human linguistic abilit...
Training data memorization in NLP can both be beneficial (e.g., closed-b...
Moving towards human-like linguistic performance is often argued to requ...
We investigate the semantic knowledge of language models (LMs), focusing...
The field of explainable AI has recently seen an explosion in the number...
A possible explanation for the impressive performance of masked language...
In this paper, we propose to study language modelling as a multi-task
pr...
In this paper, we consider the syntactic properties of languages emerged...
Recursive processing in sentence comprehension is considered a hallmark ...
In previous work, artificial agents were shown to achieve almost perfect...
Referential games offer a grounded learning environment for neural agent...
Neural networks are surprisingly good at interpolating and perform remar...
Extensive research has recently shown that recurrent neural language mod...
Despite a multitude of empirical studies, little consensus exists on whe...
Since their inception, encoder-decoder models have successfully been app...
We present a detailed comparison of two types of sequence to sequence mo...
While sequence-to-sequence models have shown remarkable generalization p...
Recent work has shown that LSTMs trained on a generic language modeling
...
Human language, music and a variety of animal vocalisations constitute w...
Learning to follow human instructions is a challenging task because whil...
In this paper, we attempt to link the inner workings of a neural languag...
We investigate how encoder-decoder models trained on a synthetic dataset...
How do neural language models keep track of number agreement between sub...
In this paper, we introduce Attentive Guidance (AG), a new mechanism to
...
We investigate how neural networks can learn and process languages with
...