
Provably Correct Optimization and Exploration with Nonlinear Policies
Policy optimization methods remain a powerful workhorse in empirical Rei...
read it

Provably Breaking the Quadratic Error Compounding Barrier in Imitation Learning, Optimally
We study the statistical limits of Imitation Learning (IL) in episodic M...
read it

A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost
Many realworld applications, such as those in medical domains, recommen...
read it

Minimax Sample Complexity for Turnbased Stochastic Game
The empirical success of Multiagent reinforcement learning is encouragi...
read it

Accommodating Picky Customers: Regret Bound and Exploration Complexity for MultiObjective Reinforcement Learning
In this paper we consider multiobjective reinforcement learning where t...
read it

Episodic Linear Quadratic Regulators with Lowrank Transitions
Linear Quadratic Regulators (LQR) achieve enormous successful realworld...
read it

Random Walk Bandits
Bandit learning problems find important applications ranging from medica...
read it

Is Plugin Solver SampleEfficient for Featurebased Reinforcement Learning?
It is believed that a modelbased approach for reinforcement learning (R...
read it

Toward the Fundamental Limits of Imitation Learning
Imitation learning (IL) aims to mimic the behavior of an expert policy i...
read it

Obtaining Adjustable Regularization for Free via Iterate Averaging
Regularization for optimization is a crucial technique to avoid overfitt...
read it

ModelBased MultiAgent RL in ZeroSum Markov Games with NearOptimal Sample Complexity
Modelbased reinforcement learning (RL), which finds an optimal policy u...
read it

On RewardFree Reinforcement Learning with Linear Function Approximation
Rewardfree reinforcement learning (RL) is a framework which is suitable...
read it

Qlearning with Logarithmic Regret
This paper presents the first nonasymptotic result showing that a model...
read it

Preferencebased Reinforcement Learning with FiniteTime Guarantees
Preferencebased Reinforcement Learning (PbRL) replaces reward values in...
read it

ModelBased Reinforcement Learning with ValueTargeted Regression
This paper studies modelbased reinforcement learning (RL) for regret mi...
read it

Provably Efficient Reinforcement Learning with General Value Function Approximation
Value function approximation has demonstrated phenomenal empirical succe...
read it

Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning?
Learning to plan for long horizons is a central challenge in episodic re...
read it

Provably Efficient Exploration for RL with Unsupervised Learning
We study how to use unsupervised learning for efficient exploration in r...
read it

Sketching Transformed Matrices with Applications to Natural Language Processing
Suppose we are given a large matrix A=(a_i,j) that cannot be stored in m...
read it

Does Knowledge Transfer Always Help to Learn a Better Policy?
One of the key approaches to save samples when learning a policy for a r...
read it

Continuous Control with Contexts, Provably
A fundamental challenge in artificial intelligence is to build an agent ...
read it

Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?
Modern deep learning methods provide an effective means to learn good re...
read it

Efficient Symmetric Norm Regression via Linear Sketching
We provide efficient algorithms for overconstrained linear regression pr...
read it

Solving Discounted Stochastic TwoPlayer Games with NearOptimal Time and Sample Complexity
In this paper, we settle the sampling complexity of solving discounted t...
read it

On the Optimality of Sparse ModelBased Planning for Markov Decision Processes
This work considers the sample complexity of obtaining an ϵoptimal poli...
read it

FeatureBased QLearning for TwoPlayer Stochastic Games
Consider a twoplayer zerosum stochastic game where the transition func...
read it

Reinforcement Leaning in Feature Space: Matrix Bandit, Kernels, and Regret Bound
Exploration in reinforcement learning (RL) suffers from the curse of dim...
read it

Learning to Control in Metric Space with Optimal Regret
We study online reinforcement learning for finitehorizon deterministic ...
read it

The OneWay Communication Complexity of Dynamic Time Warping Distance
We resolve the randomized oneway communication complexity of Dynamic Ti...
read it

SampleOptimal Parametric QLearning with Linear Transition Models
Consider a Markov decision process (MDP) that admits a set of stateacti...
read it

Towards a Theoretical Understanding of HashingBased Neural Nets
Parameter reduction has been an important topic in deep learning due to ...
read it

Universal Streaming of Subset Norms
Most known algorithms in the streaming model of computation aim to appro...
read it

On Landscape of Lagrangian Functions and Stochastic Search for Constrained Nonconvex Optimization
We study constrained nonconvex optimization problems in machine learning...
read it

Revisiting Frequency Moment Estimation in Random Order Streams
We revisit one of the classic problems in the data stream literature, na...
read it

Variance Reduction Methods for Sublinear Reinforcement Learning
This work considers the problem of provably optimal reinforcement learni...
read it

Sensitivity Sampling Over Dynamic Geometric Data Streams with Applications to kClustering
Sensitivity based sampling is crucial for constructing nearlyoptimal co...
read it

Misspecified Nonconvex Statistical Optimization for Phase Retrieval
Existing nonconvex statistical optimization theory and methods crucially...
read it

Approximate Convex Hull of Data Streams
Given a finite set of points P ⊆R^d, we would like to find a small subse...
read it

On Quadratic Convergence of DC Proximal Newton Algorithm for Nonconvex Sparse Learning in High Dimensions
We propose a DC proximal Newton algorithm for solving nonconvex regulari...
read it

Online Factorization and Partition of Complex Networks From Random Walks
Finding the reduceddimensional structure is critical to understanding c...
read it

Dropping Convexity for More Efficient and Scalable Online Multiview Learning
Multiview representation learning is very popular for latent factor anal...
read it
Lin F. Yang
is this you? claim profile