
-
Lifelong Learning with Sketched Structural Regularization
Preventing catastrophic forgetting while continually learning new tasks ...
read it
-
Benign Overfitting of Constant-Stepsize SGD for Linear Regression
There is an increasing realization that algorithmic inductive biases are...
read it
-
Accommodating Picky Customers: Regret Bound and Exploration Complexity for Multi-Objective Reinforcement Learning
In this paper we consider multi-objective reinforcement learning where t...
read it
-
Direction Matters: On the Implicit Regularization Effect of Stochastic Gradient Descent with Moderate Learning Rate
Understanding the algorithmic regularization effect of stochastic gradie...
read it
-
Obtaining Adjustable Regularization for Free via Iterate Averaging
Regularization for optimization is a crucial technique to avoid overfitt...
read it
-
The Multiplicative Noise in Stochastic Gradient Descent: Data-Dependent Regularization, Continuous and Discrete Approximation
The randomness in Stochastic Gradient Descent (SGD) is considered to pla...
read it
-
Tangent-Normal Adversarial Regularization for Semi-supervised Learning
The ever-increasing size of modern datasets combined with the difficulty...
read it
-
The Regularization Effects of Anisotropic Noise in Stochastic Gradient Descent
Understanding the generalization of deep learning has raised lots of con...
read it