Giancarlo Kerg

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Yoshua Bengio
448 publications
Kyunghyun Cho
218 publications
Caiming Xiong
210 publications
Richard Socher
111 publications
Huan Wang
82 publications
Anirudh Goyal
61 publications
Guy Wolf
50 publications
Ioannis Mitliagkas
46 publications
Gauthier Gidel
46 publications
Nan Rosemary Ke
33 publications
David Rolnick
31 publications

research

∙ 06/09/2022

On Neural Architecture Inductive Biases for Relational Tasks

Current deep learning approaches have shown good in-distribution general...

20 Giancarlo Kerg, et al. ∙

research

∙ 03/02/2022

Continuous-Time Meta-Learning with Forward Mode Differentiation

Drawing inspiration from gradient-based meta-learning methods with infin...

0 Tristan Deleu, et al. ∙

research

∙ 12/28/2020

Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization

The early phase of training has been shown to be important in two ways f...

16 Stanisław Jastrzębski, et al. ∙

research

∙ 06/22/2020

Advantages of biologically-inspired adaptive neural activation in RNNs during learning

Dynamic adaptation in single-neuron response plays a fundamental role in...

18 Victor Geadah, et al. ∙

research

∙ 06/16/2020

Untangling tradeoffs between recurrence and self-attention in neural networks

Attention and self-attention mechanisms, inspired by cognitive processes...

0 Giancarlo Kerg, et al. ∙

research

∙ 05/28/2019

Non-normal Recurrent Neural Network (nnRNN): learning long time dependencies while improving expressivity with transient dynamics

A recent strategy to circumvent the exploding and vanishing gradient pro...

1 Giancarlo Kerg, et al. ∙

research

∙ 10/06/2018

h-detach: Modifying the LSTM Gradient Towards Better Optimization

Recurrent neural networks are known for their notorious exploding and va...

0 Devansh Arpit, et al. ∙

Giancarlo Kerg

Featured Co-authors

On Neural Architecture Inductive Biases for Relational Tasks

Continuous-Time Meta-Learning with Forward Mode Differentiation

Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization

Advantages of biologically-inspired adaptive neural activation in RNNs during learning

Untangling tradeoffs between recurrence and self-attention in neural networks

Non-normal Recurrent Neural Network (nnRNN): learning long time dependencies while improving expressivity with transient dynamics

h-detach: Modifying the LSTM Gradient Towards Better Optimization

Sign in with Google

Consider DeepAI Pro