
Neural Thompson Sampling
Thompson Sampling (TS) is one of the most effective algorithms for solvi...
read it

Minimax Optimal Reinforcement Learning for Discounted MDPs
We study the reinforcement learning problem for discounted Markov Decisi...
read it

Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping
Modern tasks in reinforcement learning are always with large state and a...
read it

Neural Contextual Bandits with Upper Confidence BoundBased Exploration
We study the stochastic contextual bandit problem, where the reward is g...
read it

Stochastic Recursive VarianceReduced Cubic Regularization Methods
Stochastic VarianceReduced Cubic regularization (SVRC) algorithms have ...
read it

Lower Bounds for Smooth Nonconvex FiniteSum Optimization
Smooth finitesum optimization has been widely studied in both convex an...
read it

Sample Efficient Stochastic VarianceReduced Cubic Regularization Method
We propose a sample efficient stochastic variancereduced cubic regulari...
read it

Stochastic Gradient Descent Optimizes Overparameterized Deep ReLU Networks
We study the problem of training deep neural networks with Rectified Lin...
read it

On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization
Adaptive gradient methods are workhorses in deep learning. However, the ...
read it

Finding Local Minima via Stochastic Nested Variance Reduction
We propose two algorithms that can find local minima faster than the sta...
read it

Stochastic Nested Variance Reduction for Nonconvex Optimization
We study finitesum nonconvex optimization problems, where the objective...
read it

Stochastic VarianceReduced Cubic Regularized Newton Method
We propose a stochastic variancereduced cubic regularized Newton method...
read it
Dongruo Zhou
is this you? claim profile