Recent efforts at explaining the interplay of memorization and generaliz...
Studying data memorization in neural language models helps us understand...
Machine learning models trained on private datasets have been shown to l...
This paper considers the Pointer Value Retrieval (PVR) benchmark introdu...
With the increasing adoption of NLP models in real-world products, it be...
Large language models (LMs) have been shown to memorize parts of their
t...
Modern neural language models widely used in tasks across NLP risk memor...
We investigate the robustness of vision transformers (ViTs) through the ...
Convolutional neural networks (CNNs) have so far been the de-facto model...
The successes of deep learning critically rely on the ability of neural
...
We find that existing language modeling datasets contain many near-dupli...
A discriminatively trained neural net classifier achieves optimal perfor...
In many machine learning applications, the training data can contain hig...
One desired capability for machines is the ability to transfer their
kno...
Deep learning algorithms are well-known to have a propensity for fitting...
Human learners appreciate that some facts demand memorization whereas ot...
With the increasingly varied applications of deep learning, transfer lea...
We study the interplay between memorization and generalization of
overpa...
Understanding learning and generalization of deep architectures has been...
We introduce a two-player contest for evaluating the safety and robustne...
Recent years have witnessed significant progresses in deep Reinforcement...
Theory of mind (ToM; Premack & Woodruff, 1978) broadly refers to humans'...
In Theory IIb we characterize with a mix of theory and experiments the
o...
MXNet is a multi-language machine learning (ML) library to ease the
deve...
Learning to predict multi-label outputs is challenging, but in many prob...
Representations in the auditory cortex might be based on mechanisms simi...