
-
Adaptive Federated Learning and Digital Twin for Industrial Internet of Things
Industrial Internet of Things (IoT) enables distributed intelligent serv...
read it
-
Learning the Linear Quadratic Regulator from Nonlinear Observations
We introduce a new problem setting for continuous control called the LQR...
read it
-
PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Direct policy gradient methods for reinforcement learning are a successf...
read it
-
Information Theoretic Regret Bounds for Online Nonlinear Control
This work studies the problem of sequential control in an unknown, nonli...
read it
-
FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs
In order to deal with the curse of dimensionality in reinforcement learn...
read it
-
Provably Efficient Model-based Policy Adaptation
The high sample complexity of reinforcement learning challenges its use ...
read it
-
Constrained episodic reinforcement learning in concave-convex and knapsack settings
We propose an algorithm for tabular episodic reinforcement learning with...
read it
-
Arbitrary Style Transfer via Multi-Adaptation Network
Arbitrary style transfer is a significant topic with both research value...
read it
-
Exploration in Action Space
Parameter space exploration methods with black-box optimization have rec...
read it
-
Corruption Robust Exploration in Episodic Reinforcement Learning
We initiate the study of multi-stage episodic reinforcement learning und...
read it
-
Policy Poisoning in Batch Reinforcement Learning and Control
We study a security threat to batch reinforcement learning and control w...
read it
-
Optimal Sketching for Kronecker Product Regression and Low Rank Approximation
We study the Kronecker product regression problem, in which the design m...
read it
-
Imitation Learning as f-Divergence Minimization
We address the problem of imitation learning with multi-modal demonstrat...
read it
-
Provably Efficient Imitation Learning from Observation Alone
We study Imitation Learning (IL) from Observations alone (ILFO) in large...
read it
-
Efficient Model-free Reinforcement Learning in Metric Spaces
Model-free Reinforcement Learning (RL) algorithms such as Q-learning [Wa...
read it
-
Outcome-Driven Clustering of Acute Coronary Syndrome Patients using Multi-Task Neural Network with Attention
Cluster analysis aims at separating patients into phenotypically heterog...
read it
-
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective
Black-box optimizers that explore in parameter space have often been sho...
read it
-
Model-Based Reinforcement Learning in Contextual Decision Processes
We study the sample complexity of model-based reinforcement learning in ...
read it
-
Contextual Memory Trees
We design and study a Contextual Memory Tree (CMT), a learning memory co...
read it
-
Truncated Horizon Policy Search: Combining Reinforcement Learning & Imitation Learning
In this paper, we propose to combine imitation and reinforcement learnin...
read it
-
Dual Policy Iteration
Recently, a novel class of Approximate Policy Iteration (API) algorithms...
read it
-
Recurrent Predictive State Policy Networks
We introduce Recurrent Predictive State Policy (RPSP) networks, a recurr...
read it
-
Sketching for Kronecker Product Regression and P-splines
TensorSketch is an oblivious linear sketch introduced in Pagh'13 and lat...
read it
-
Predictive-State Decoders: Encoding the Future into Recurrent Networks
Recurrent neural networks (RNNs) are a vital modeling technique that rel...
read it
-
Risk-Aware Algorithms for Adversarial Contextual Bandits
In this work we consider adversarial contextual bandits with risk constr...
read it