Most learning algorithms in machine learning rely on gradient descent to...
Many learning algorithms used as normative models in neuroscience or as
...
Pre-trained deep image representations are useful for post-training task...
Conscious states (states that there is something it is like to be in) se...
When presented with a data stream of two statistically dependent variabl...
Comparing learned neural representations in neural networks is a challen...
Diffusion-based generative models learn to iteratively transfer unstruct...
Among attempts at giving a theoretical account of the success of deep ne...
Current deep learning approaches have shown good in-distribution
general...
Inspired from human cognition, machine learning systems are gradually
re...
To unveil how the brain learns, ongoing work seeks biologically-plausibl...
Drawing inspiration from gradient-based meta-learning methods with infin...
The Energy-Based Model (EBM) framework is a very general approach to
gen...
A key challenge in building theoretical foundations for deep learning is...
Multi-head, key-value attention is the backbone of the widely successful...
In modern relational machine learning it is common to encounter large gr...
Inducing causal relationships from observations is a classic problem in
...
In a real-world setting biological agents do not have infinite resources...
We identify and formalize a fundamental gradient descent phenomenon resu...
Adversarial formulations such as generative adversarial networks (GANs) ...
We approach the problem of implicit regularization in deep learning from...
Robust perception relies on both bottom-up and top-down signals. Bottom-...
Recurrent neural networks (RNNs) have been successfully applied to a var...
Dynamic adaptation in single-neuron response plays a fundamental role in...
Attention and self-attention mechanisms, inspired by cognitive processes...
The efficiency of recurrent neural networks (RNNs) in dealing with seque...
Datasets such as images, text, or movies are embedded in high-dimensiona...
A recent strategy to circumvent the exploding and vanishing gradient pro...