
Robust Generative Adversarial Imitation Learning via Local Lipschitzness
We explore methodologies to improve the robustness of generative adversa...
read it

Convex Optimization for Parameter Synthesis in MDPs
Probabilistic model checking aims to prove whether a Markov decision pro...
read it

Probabilistic Control of Heterogeneous Swarms Subject to Graph Temporal Logic Specifications: A Decentralized and Scalable Approach
We develop a probabilistic control algorithm, , for swarms of agents wit...
read it

NonParametric NeuroAdaptive Control Subject to Task Specifications
We develop a learningbased algorithm for the control of robotic systems...
read it

Learning to Reach, Swim, Walk and Fly in One Trial: DataDriven Control with Scarce Data and Side Information
We develop a learningbased control algorithm for unknown dynamical syst...
read it

Robust Training in High Dimensions via Block Coordinate Geometric Median Descent
Geometric median (Gm) is a classical method in statistics for achieving ...
read it

Verifiable and Compositional Reinforcement Learning Systems
We propose a novel framework for verifiable and compositional reinforcem...
read it

TaskGuided Inverse Reinforcement Learning Under Partial Information
We study the problem of inverse reinforcement learning (IRL), where the ...
read it

UncertaintyAware Signal Temporal Logic Inference
Temporal logic inference is the process of extracting formal description...
read it

Identity Concealment Games: How I Learned to Stop Revealing and Love the Coincidences
In an adversarial environment, a hostile player performing a task may be...
read it

Efficient Strategy Synthesis for MDPs with Resource Constraints
We consider qualitative strategy synthesis for the formalism called cons...
read it

PolynomialTime Algorithms for MultiAgent MinimalCapacity Planning
We study the problem of minimizing the resource capacity of autonomous a...
read it

Learning Linear Temporal Properties from Noisy Data: A MaxSAT Approach
We address the problem of inferring descriptions of system behavior usin...
read it

SelfSupervised Online Reward Shaping in SparseReward Environments
We propose a novel reinforcement learning framework that performs selfs...
read it

Function Approximation via Sparse Random Features
Random feature methods have been successful in various machine learning ...
read it

A 3D Printing Hexacopter: Design and Demonstration
3D printing using robots has garnered significant interest in manufactur...
read it

Safe MultiAgent Reinforcement Learning via Shielding
Multiagent reinforcement learning (MARL) has been increasingly used in ...
read it

Multiple Plans are Better than One: Diverse Stochastic Planning
In planning problems, it is often challenging to fully model the desired...
read it

Towards online monitoring and datadriven control: a study of segmentation algorithms for infrared images of the powder bed
An increasing number of selective laser sintering and selective laser me...
read it

Assured Autonomy: Path Toward Living With Autonomous Systems We Can Trust
The challenge of establishing assurance in autonomy is rapidly attractin...
read it

MinimumViolation Planning for Autonomous Systems: Theoretical and Practical Considerations
This paper considers the problem of computing an optimal trajectory for ...
read it

Robust FiniteState Controllers for Uncertain POMDPs
Uncertain partially observable Markov decision processes (uPOMDPs) allow...
read it

BPRRT: Barrier Pair Synthesis for Temporal Logic Motion Planning
For a nonlinear system (e.g. a robot) with its continuous state space tr...
read it

NearOptimal Reactive Synthesis Incorporating Runtime Information
We consider the problem of optimal reactive synthesis  compute a strate...
read it

Distributed Policy Synthesis of MultiAgent Systems With Graph Temporal Logic Specifications
We study the distributed synthesis of policies for multiagent systems t...
read it

Qualitative Controller Synthesis for Consumption Markov Decision Processes
Consumption Markov Decision Processes (CMDPs) are probabilistic decision...
read it

PrivacyPreserving Policy Synthesis in Markov Decision Processes
In decisionmaking problems, the actions of an agent may reveal sensitiv...
read it

Scalable Synthesis of MinimumInformation LinearGaussian Control by Distributed Optimization
We consider a discretetime linearquadratic Gaussian control problem in...
read it

Scalable Synthesis of MinimumInformation LinearGaussianControl by Distributed Optimization
We consider a discretetime linearquadratic Gaussian control problem in...
read it

Distributed Beamforming for Agents with Localization Errors
We consider a scenario in which a group of agents aim to collectively tr...
read it

Verifiable RNNBased Policies for POMDPs Under Temporal Logic Constraints
Recurrent neural networks (RNNs) have emerged as an effective representa...
read it

Adaptive Teaching of Temporal Logic Formulas to Learners with Preferences
Machine teaching is an algorithmic framework for teaching a target hypot...
read it

Active TaskInferenceGuided Deep Inverse Reinforcement Learning
In inverse reinforcement learning (IRL), given a Markov decision process...
read it

Policy Synthesis for Factored MDPs with Graph Temporal Logic Specifications
We study the synthesis of policies for multiagent systems to implement ...
read it

ScenarioBased Verification of Uncertain MDPs
We consider Markov decision processes (MDPs) in which the transition pro...
read it

Learning and Planning for TimeVarying MDPs Using Maximum Likelihood Estimation
This paper proposes a formal approach to learning and planning for agent...
read it

Controller Synthesis of Wind Turbine Generator and Energy Storage System with Stochastic Wind Variations under Temporal Logic Specifications
In this paper, we present a controller synthesis approach for wind turbi...
read it

Strategy Synthesis for SurveillanceEvasion Games with LearningEnabled Visibility Optimization
This paper studies a twoplayer game with a quantitative surveillance re...
read it

Online Synthesis for Runtime Enforcement of Safety in MultiAgent Systems
A shield is attached to a system to guarantee safety by correcting the s...
read it

Decentralized Runtime Synthesis of Shields for MultiAgent Systems
A shield is attached to a system to guarantee safety by correcting the s...
read it

Online Active Perception for Partially Observable Markov Decision Processes with Limited Budget
Active perception strategies enable an agent to selectively gather infor...
read it

The Dirichlet Mechanism for Differential Privacy on the Unit Simplex
As members of a network share more information with each other and netwo...
read it

Differentially Private Controller Synthesis With Metric Temporal Logic Specifications
Privacy is an important concern in various multiagent systems in which d...
read it

Identifying LowDimensional Structures in Markov Chains: A Nonnegative Matrix Factorization Approach
A variety of queries about stochastic systems boil down to study of Mark...
read it

Controller Synthesis for MultiAgent Systems With Intermittent Communication: A Metric Temporal Logic Approach
This paper develops a controller synthesis approach for a multiagent sy...
read it

Joint Inference of Reward Machines and Policies for Reinforcement Learning
Incorporating highlevel knowledge is an effective way to expedite reinf...
read it

Transfer of Temporal Logic Formulas in Reinforcement Learning
Transferring highlevel knowledge from a source task to a target task is...
read it

An EncoderDecoder Based Approach for Anomaly Detection with Application in Additive Manufacturing
We present a novel unsupervised deep learning approach that utilizes the...
read it

Synthesis of Provably Correct Autonomy Protocols for Shared Control
We synthesize shared control protocols subject to probabilistic temporal...
read it

RewardBased Deception with Cognitive Bias
Deception plays a key role in adversarial or strategic interactions for ...
read it
Ufuk Topcu
is this you? claim profile
Assistant Professor Department of Aerospace Engineering and Engineering Mechanics at The University of Texas