
-
Derivative-Free Policy Optimization for Risk-Sensitive and Robust Control Design: Implicit Regularization and Sample Complexity
Direct policy search serves as one of the workhorses in modern reinforce...
read it
-
Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
We consider model-free reinforcement learning (RL) in non-stationary Mar...
read it
-
Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games
In this paper, we study large population multi-agent reinforcement learn...
read it
-
Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity
Model-based reinforcement learning (RL), which finds an optimal policy u...
read it
-
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
Monte-Carlo planning, as exemplified by Monte-Carlo Tree Search (MCTS), ...
read it
-
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
Multi-agent reinforcement learning (MARL) under partial observability ha...
read it
-
Approximate Equilibrium Computation for Discrete-Time Linear-Quadratic Mean-Field Games
While the topic of mean-field games (MFGs) has a relatively long history...
read it
-
Asynchronous Policy Evaluation in Distributed Reinforcement Learning over Networks
This paper proposes a fully asynchronous scheme for policy evaluation of...
read it
-
Distributed Adaptive Newton Methods with Globally Superlinear Convergence
This paper considers the distributed optimization problem over a network...
read it
-
Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances
Multi-agent reinforcement learning (MARL) has long been a significant an...
read it
-
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Recent years have witnessed significant advances in reinforcement learni...
read it
-
Non-Cooperative Inverse Reinforcement Learning
Making decisions in the presence of a strategic opponent requires one to...
read it
-
Policy Optimization for H_2 Linear Control with H_∞ Robustness Guarantee: Implicit Regularization and Global Convergence
Policy optimization (PO) is a key ingredient for reinforcement learning ...
read it
-
Strategic Inference with a Single Private Sample
Motivated by applications in cyber security, we develop a simple game mo...
read it
-
Online Planning for Decentralized Stochastic Control with Partial History Sharing
In decentralized stochastic control, standard approaches for sequential ...
read it
-
Optimal Hierarchical Signaling for Quadratic Cost Measures and General Distributions: A Copositive Program Characterization
In this paper, we address the problem of optimal hierarchical signaling ...
read it
-
A Communication-Efficient Multi-Agent Actor-Critic Algorithm for Distributed Reinforcement Learning
This paper considers a distributed reinforcement learning problem in whi...
read it
-
Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies
Policy gradient (PG) methods are a widely used reinforcement learning me...
read it
-
Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games
We study the global convergence of policy optimization for finding the N...
read it
-
A Multi-Agent Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning
This paper extends off-policy reinforcement learning to the multi-agent ...
read it
-
A Game of Drones: Cyber-Physical Security of Time-Critical UAV Applications with Cumulative Prospect Theory Perceptions and Valuations
The effective deployment of unmanned aerial vehicle (UAV) systems and se...
read it
-
Deception-As-Defense Framework for Cyber-Physical Systems
We introduce deceptive signaling framework as a new defense measure agai...
read it
-
A Game Theoretical Error-Correction Framework for Secure Traffic-Sign Classification
We introduce a game theoretical error-correction framework to design cla...
read it
-
Robust Sensor Design Against Multiple Attackers with Misaligned Control Objectives
We introduce a robust sensor design framework to provide defense against...
read it
-
Communication-Efficient Distributed Reinforcement Learning
This paper studies the distributed reinforcement learning (DRL) problem ...
read it
-
Finite-Sample Analyses for Fully Decentralized Multi-Agent Reinforcement Learning
Despite the increasing interest in multi-agent reinforcement learning (M...
read it
-
Distributed Learning of Average Belief Over Networks Using Sequential Observations
This paper addresses the problem of distributed learning of average beli...
read it
-
Revisiting Client Puzzles for State Exhaustion Attacks Resilience
In this paper, we address the challenges facing the adoption of client p...
read it
-
Resilient Output Synchronization of Heterogeneous Multi-agent Systems under Cyber-Physical Attacks
In this paper, we first describe, supported with analysis, the adverse e...
read it
-
On Remote Estimation with Multiple Communication Channels
This paper considers a sequential sensor scheduling and remote estimatio...
read it
-
Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents
We consider the problem of fully decentralized multi-agent reinforcement...
read it
-
Reliable Intersection Control in Non-cooperative Environments
We propose a reliable intersection control mechanism for strategic auton...
read it
-
Graph-Theoretic Framework for Unified Analysis of Observability and Data Injection Attacks in the Smart Grid
In this paper, a novel graph-theoretic framework is proposed to generali...
read it
-
Countries' Survival in Networked International Environments
This paper applies a recently developed power allocation game in Li and ...
read it
-
Generalized Colonel Blotto Game
Competitive resource allocation between adversarial decision makers aris...
read it
-
A Game-Theoretic Method for Multi-Period Demand Response: Revenue Maximization, Power Allocation, and Asymptotic Behavior
We study a multi-period demand response management problem in the smart ...
read it
-
Discrete-Time Polar Opinion Dynamics with Susceptibility
This paper considers a discrete-time opinion dynamics model in which eac...
read it
-
Strategic Communication Between Prospect Theoretic Agents over a Gaussian Test Channel
In this paper, we model a Stackelberg game in a simple Gaussian test cha...
read it
-
Evolution of Social Power in Social Networks with Dynamic Topology
The recently proposed DeGroot-Friedkin model describes the dynamical evo...
read it
-
On the Analysis of the DeGroot-Friedkin Model with Dynamic Relative Interaction Matrices
This paper analyses the DeGroot-Friedkin model for evolution of the indi...
read it
-
Adaptive-Rate Compressive Sensing Using Side Information
We provide two novel adaptive-rate compressive sensing (CS) strategies f...
read it