This paper focuses on predicting the occurrence of grokking in neural
ne...
Large language models (LLMs) have shown increasing in-context learning
c...
Forgetting is often seen as an unwanted characteristic in both human and...
Neural networks enjoy widespread use, but many aspects of their training...
The recent "Lottery Ticket Hypothesis" paper by Frankle & Carbin showed ...