-
Self-Tuning Stochastic Optimization with Curvature-Aware Gradient Filtering
Standard first-order stochastic optimization algorithms base their updat...
read it
-
Limitations of the Empirical Fisher Approximation
Natural gradient descent, which preconditions a gradient descent update ...
read it
-
DeepOBS: A Deep Learning Optimizer Benchmark Suite
Because the choice and tuning of the optimizer affects the speed, and ul...
read it
-
Follow the Signs for Robust Stochastic Optimization
Stochastic noise on gradients is now a common feature in machine learnin...
read it
-
Early Stopping without a Validation Set
Early stopping is a widely used technique to prevent poor generalization...
read it
-
Coupling Adaptive Batch Sizes with Learning Rates
Mini-batch stochastic gradient descent and variants thereof have become ...
read it

Lukas Balles
is this you? claim profile