
Robust Generative Adversarial Imitation Learning via Local Lipschitzness
We explore methodologies to improve the robustness of generative adversa...
Convex Optimization for Parameter Synthesis in MDPs
Probabilistic model checking aims to prove whether a Markov decision pro...
Probabilistic Control of Heterogeneous Swarms Subject to Graph Temporal Logic Specifications: A Decentralized and Scalable Approach
We develop a probabilistic control algorithm, , for swarms of agents wit...
NonParametric NeuroAdaptive Control Subject to Task Specifications
We develop a learningbased algorithm for the control of robotic systems...
Learning to Reach, Swim, Walk and Fly in One Trial: DataDriven Control with Scarce Data and Side Information
We develop a learningbased control algorithm for unknown dynamical syst...
Robust Training in High Dimensions via Block Coordinate Geometric Median Descent
Geometric median (Gm) is a classical method in statistics for achieving ...
Verifiable and Compositional Reinforcement Learning Systems
We propose a novel framework for verifiable and compositional reinforcem...
TaskGuided Inverse Reinforcement Learning Under Partial Information
We study the problem of inverse reinforcement learning (IRL), where the ...
UncertaintyAware Signal Temporal Logic Inference
Temporal logic inference is the process of extracting formal description...
Identity Concealment Games: How I Learned to Stop Revealing and Love the Coincidences
In an adversarial environment, a hostile player performing a task may be...
Efficient Strategy Synthesis for MDPs with Resource Constraints
We consider qualitative strategy synthesis for the formalism called cons...
PolynomialTime Algorithms for MultiAgent MinimalCapacity Planning
We study the problem of minimizing the resource capacity of autonomous a...
Learning Linear Temporal Properties from Noisy Data: A MaxSAT Approach
We address the problem of inferring descriptions of system behavior usin...
SelfSupervised Online Reward Shaping in SparseReward Environments
We propose a novel reinforcement learning framework that performs selfs...
Function Approximation via Sparse Random Features
Random feature methods have been successful in various machine learning ...
A 3D Printing Hexacopter: Design and Demonstration
3D printing using robots has garnered significant interest in manufactur...
Safe MultiAgent Reinforcement Learning via Shielding
Multiagent reinforcement learning (MARL) has been increasingly used in ...
Multiple Plans are Better than One: Diverse Stochastic Planning
In planning problems, it is often challenging to fully model the desired...
Towards online monitoring and datadriven control: a study of segmentation algorithms for infrared images of the powder bed
An increasing number of selective laser sintering and selective laser me...
Assured Autonomy: Path Toward Living With Autonomous Systems We Can Trust
The challenge of establishing assurance in autonomy is rapidly attractin...
MinimumViolation Planning for Autonomous Systems: Theoretical and Practical Considerations
This paper considers the problem of computing an optimal trajectory for ...
Robust FiniteState Controllers for Uncertain POMDPs
Uncertain partially observable Markov decision processes (uPOMDPs) allow...
BPRRT: Barrier Pair Synthesis for Temporal Logic Motion Planning
For a nonlinear system (e.g. a robot) with its continuous state space tr...
NearOptimal Reactive Synthesis Incorporating Runtime Information
We consider the problem of optimal reactive synthesis  compute a strate...
Distributed Policy Synthesis of MultiAgent Systems With Graph Temporal Logic Specifications
We study the distributed synthesis of policies for multiagent systems t...
Qualitative Controller Synthesis for Consumption Markov Decision Processes
Consumption Markov Decision Processes (CMDPs) are probabilistic decision...
PrivacyPreserving Policy Synthesis in Markov Decision Processes
In decisionmaking problems, the actions of an agent may reveal sensitiv...
Scalable Synthesis of MinimumInformation LinearGaussian Control by Distributed Optimization
We consider a discretetime linearquadratic Gaussian control problem in...
Distributed Beamforming for Agents with Localization Errors
We consider a scenario in which a group of agents aim to collectively tr...
Verifiable RNNBased Policies for POMDPs Under Temporal Logic Constraints
Recurrent neural networks (RNNs) have emerged as an effective representa...
Adaptive Teaching of Temporal Logic Formulas to Learners with Preferences
Machine teaching is an algorithmic framework for teaching a target hypot...
Active TaskInferenceGuided Deep Inverse Reinforcement Learning
In inverse reinforcement learning (IRL), given a Markov decision process...
Policy Synthesis for Factored MDPs with Graph Temporal Logic Specifications
We study the synthesis of policies for multiagent systems to implement ...
ScenarioBased Verification of Uncertain MDPs
We consider Markov decision processes (MDPs) in which the transition pro...
Learning and Planning for TimeVarying MDPs Using Maximum Likelihood Estimation
This paper proposes a formal approach to learning and planning for agent...
Controller Synthesis of Wind Turbine Generator and Energy Storage System with Stochastic Wind Variations under Temporal Logic Specifications
In this paper, we present a controller synthesis approach for wind turbi...
Strategy Synthesis for SurveillanceEvasion Games with LearningEnabled Visibility Optimization
This paper studies a twoplayer game with a quantitative surveillance re...
Online Synthesis for Runtime Enforcement of Safety in MultiAgent Systems
A shield is attached to a system to guarantee safety by correcting the s...
Decentralized Runtime Synthesis of Shields for MultiAgent Systems
A shield is attached to a system to guarantee safety by correcting the s...
Online Active Perception for Partially Observable Markov Decision Processes with Limited Budget
Active perception strategies enable an agent to selectively gather infor...
The Dirichlet Mechanism for Differential Privacy on the Unit Simplex
As members of a network share more information with each other and netwo...
Differentially Private Controller Synthesis With Metric Temporal Logic Specifications
Privacy is an important concern in various multiagent systems in which d...
Identifying LowDimensional Structures in Markov Chains: A Nonnegative Matrix Factorization Approach
A variety of queries about stochastic systems boil down to study of Mark...
Controller Synthesis for MultiAgent Systems With Intermittent Communication: A Metric Temporal Logic Approach
This paper develops a controller synthesis approach for a multiagent sy...
Joint Inference of Reward Machines and Policies for Reinforcement Learning
Incorporating highlevel knowledge is an effective way to expedite reinf...
Transfer of Temporal Logic Formulas in Reinforcement Learning
Transferring highlevel knowledge from a source task to a target task is...
An EncoderDecoder Based Approach for Anomaly Detection with Application in Additive Manufacturing
We present a novel unsupervised deep learning approach that utilizes the...
Synthesis of Provably Correct Autonomy Protocols for Shared Control
We synthesize shared control protocols subject to probabilistic temporal...
RewardBased Deception with Cognitive Bias
Deception plays a key role in adversarial or strategic interactions for ...
Ufuk Topcu
Assistant Professor Department of Aerospace Engineering and Engineering Mechanics at The University of Texas