
When is Particle Filtering Efficient for POMDP Sequential Planning?
Particle filtering is a popular method for inferring latent states in st...
read it

FewShot Learning via Learning the Representation, Provably
This paper studies fewshot learning via representation learning, where ...
read it

Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks
The selection of initial parameter values for gradientbased optimizatio...
read it

Enhanced Convolutional Neural Tangent Kernels
Recent research shows that for training with ℓ_2 loss, convolutional neu...
read it

Explaining Landscape Connectivity of Lowcost Solutions for Multilayer Nets
Mode connectivity is a surprising phenomenon in the loss landscape of de...
read it

Implicit Regularization in Deep Matrix Factorization
Efforts to understand the generalization mystery in deep learning have l...
read it

Understanding Generalization of Deep Neural Networks Trained with Noisy Labels
Overparameterized deep neural networks trained by simple firstorder me...
read it

On Exact Computation with an Infinitely Wide Neural Net
How well does a classic deep net architecture like AlexNet or VGG19 clas...
read it

FineGrained Analysis of Optimization and Generalization for Overparameterized TwoLayer Neural Networks
Recent works have cast some light on the mystery of why deep nets fit an...
read it

Width Provably Matters in Optimization for Deep Linear Neural Networks
We prove that for an Llayer fullyconnected linear neural network, if t...
read it

A Convergence Analysis of Gradient Descent for Deep Linear Neural Networks
We analyze speed of convergence to global optimum for gradient descent t...
read it

Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced
We study the implicit regularization imposed by gradient descent for lea...
read it

Online Improper Learning with an Approximation Oracle
We revisit the question of reducing online learning to approximate optim...
read it

An Analysis of the tSNE Algorithm for Data Visualization
A first line of attack in exploratory data analysis is data visualizatio...
read it

Linear Convergence of the PrimalDual Gradient Method for ConvexConcave Saddle Point Problems without Strong Convexity
We consider the convexconcave saddle point problem _x_y f(x)+y^ A xg(y...
read it

Linear Convergence of a FrankWolfe Type Algorithm over TraceNorm Balls
We propose a rankk variant of the classical FrankWolfe algorithm to so...
read it

Combinatorial MultiArmed Bandit with General Reward Functions
In this paper, we study the stochastic combinatorial multiarmed bandit ...
read it
Wei Hu
verfied profile