In this study, we investigate whether the representations learned by neu...
We demonstrate the potential of few-shot translation systems, trained wi...
The “Neural Tangent Kernel” (NTK) (Jacot et al 2018), and its empirical
...
In this work, we study the effect of varying the architecture and traini...
We revisit and extend model stitching (Lenc Vedaldi 2015) as a metho...
Current autoencoder-based disentangled representation learning methods
a...
We prove a new upper bound on the generalization gap of classifiers that...
We introduce a new notion of generalization – Distributional Generalizat...
We show that a variety of modern deep learning tasks exhibit a
"double-d...
In this work, we propose a new training method for finding minimum weigh...
In this work, we propose a new training method for finding minimum weigh...