
Reinforcement Learning under Model Mismatch
We study reinforcement learning under model misspecification, where we d...
read it

Bayesian Poolbased Active Learning With Abstention Feedbacks
We study poolbased active learning with abstention feedbacks, where a l...
read it

SAGA and Restricted Strong Convexity
SAGA is a fast incremental gradient method on the finite sum problem and...
read it

Linear convergence of SDCA in statistical estimation
In this paper, we consider stochastic dual coordinate (SDCA) without st...
read it

Outlier Robust Online Learning
We consider the problem of learning from noisy data in practical setting...
read it

Linear Convergence of SVRG in Statistical Estimation
SVRG and its variants are among the state of art optimization algorithms...
read it

Online Nonnegative Matrix Factorization with General Divergences
We develop a unified and systematic framework for performing online nonn...
read it

Accelerated Stochastic Mirror Descent Algorithms For Composite Nonstrongly Convex Optimization
We consider the problem of minimizing the sum of an average function of ...
read it

Adaptive Maximization of Pointwise Submodular Functions With Budget Constraint
We study the worstcase adaptive optimization problem with budget constr...
read it

Social Trust Prediction via Maxnorm Constrained 1bit Matrix Completion
Social trust prediction addresses the significant problem of exploring i...
read it

Efficient Online Minimization for LowRank Subspace Clustering
Lowrank representation (LRR) has been a significant method for segmenti...
read it

Distributed Robust Learning
We propose a framework for distributed robust statistical learning on b...
read it

Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms
The question why deep learning algorithms generalize so well has attract...
read it

Noisy Sparse Subspace Clustering
This paper considers the problem of subspace clustering under noise. Spe...
read it

Scaling Up Robust MDPs by Reinforcement Learning
We consider largescale Markov decision processes (MDPs) with parameter ...
read it

Improved Graph Clustering
Graph clustering involves the task of dividing nodes into clusters, so t...
read it

Clustering Partially Observed Graphs via Convex Optimization
This paper considers the problem of clustering a partially observed unwe...
read it

Matrix completion with column manipulation: Nearoptimal samplerobustnessrank tradeoffs
This paper considers the problem of matrix completion when some number o...
read it

Robust PCA via Outlier Pursuit
Singular Value Decomposition (and Principal Component Analysis) is one o...
read it

Exact Subspace Segmentation and Outlier Detection by LowRank Representation
In this work, we address the following matrix recovery problem: suppose ...
read it

Fast Global Convergence via Landscape of Empirical Loss
While optimizing convex objective (loss) functions has been a powerhouse...
read it

Deep Mean Field Games for Learning Optimal Behavior Policy of Large Populations
We consider the problem of representing a large population's behavior po...
read it

A MultiState Diagnosis and Prognosis Framework with Feature Learning for Tool Condition Monitoring
In this paper, a multistate diagnosis and prognosis (MDP) framework is ...
read it

ProjectionFree Algorithms in Statistical Estimation
FrankWolfe algorithm (FW) and its variants have gained a surge of inter...
read it

Nonlinear Distributional Gradient TemporalDifference Learning
We devise a distributional variant of gradient temporaldifference (TD) ...
read it

Robust Hypothesis Testing Using Wasserstein Uncertainty Sets
We develop a novel computationally efficient and general framework for r...
read it

Online Saddle Point Problem with Applications to Constrained Online Convex Optimization
We study an online saddle point problem where at each iteration a pair o...
read it

CommunicationEfficient ProjectionFree Algorithm for Distributed Optimization
Distributed optimization has gained a surge of interest in recent years....
read it

Learning Deep Mean Field Games for Modeling Large Population Behavior
We consider the problem of representing collective behavior of large pop...
read it

RiskAverse Stochastic Convex Bandit
Motivated by applications in clinical trials and finance, we study the p...
read it

Value Propagation for Decentralized Networked Deep Multiagent Reinforcement Learning
We consider the networked multiagent reinforcement learning (MARL) prob...
read it

RobustSTL: A Robust SeasonalTrend Decomposition Algorithm for Long Time Series
Decomposing complex time series into trend, seasonality, and remainder c...
read it

LSwarm: Efficient Collision Avoidance for Large Swarms with Coverage Constraints in Complex Urban Scenes
In this paper, we address the problem of collision avoidance for a swarm...
read it

A Unified Framework for Marketing Budget Allocation
While marketing budget allocation has been studied for decades in tradit...
read it

Large Scale Markov Decision Processes with Changing Rewards
We consider Markov Decision Processes (MDPs) where the rewards are unkno...
read it

Inductive Bias of Gradient Descent based Adversarial Training on Separable Data
Adversarial training is a principled approach for training robust neural...
read it

Bayesian Active Learning With Abstention Feedbacks
We study poolbased active learning with abstention feedbacks where a la...
read it

Competing Against Equilibria in ZeroSum Games with Evolving Payoffs
We study the problem of repeated play in a zerosum game in which the pa...
read it

Robustness and Tractability for Nonconvex Mestimators
We investigate two important properties of Mestimator, namely, robustne...
read it