
Markov Decision Processes with LongTerm Average Constraints
We consider the problem of constrained Markov Decision Process (CMDP) wh...
EnergyEfficient and Federated MetaLearning via Projected Stochastic Gradient Ascent
In this paper, we propose an energyefficient federated metalearning fr...
Joint Optimization of MultiObjective Reinforcement Learning with Policy Gradient Based Algorithm
Many engineering problems have multiple objectives, and the overall aim ...
Quantum Causal Inference in the Presence of Hidden Common Causes: an Entropic Approach
Quantum causality is an emerging field of study which has the potential ...
AdaPool: A DiurnalAdaptive Fleet Management Framework using ModelFree Deep Reinforcement Learning and Change Point Detection
This paper introduces an adaptive modelfree deep reinforcement approach...
DeepFreight: A Modelfree Deepreinforcementlearningbased Algorithm for Multitransfer Freight Delivery
With the freight delivery demands and shipping costs increasing rapidly,...
Learning Monopoly Gameplay: A Hybrid ModelFree Deep Reinforcement Learning and Imitation Learning Approach
Learning how to adapt and make realtime informed decisions in dynamic a...
Quantum Entropic Causal Inference
As quantum computing and networking nodes scaleup, important open quest...
Communication Efficient Parallel Reinforcement Learning
We consider the problem where M agents interact with M identical and ind...
MultiAgent MultiArmed Bandits with Limited Communication
We consider the problem where N agents collaboratively interact with an ...
A Supervised Learning Approach for Robust Health Monitoring using Face Videos
Monitoring of cardiovascular activity is highly desired and can enable n...
Model Free Reinforcement Learning Algorithm for Stationary Mean field Equilibrium for Multiple Types of Agents
We consider a multiagent Markov strategic interaction over an infinite ...
A multiagent evolutionary robotics framework to train spiking neural networks
A novel multiagent evolutionary robotics (ER) based framework, inspired...
PassGoodPool: Joint Passengers and Goods Fleet Management with Reinforcement Learning aided Pricing, Matching, and Route Planning
In this paper, we present a dynamic, demand aware, and pricingbased mat...
Blind Decision Making: Reinforcement Learning with Delayed Observations
Reinforcement learning typically assumes that the state update from the ...
DART: aDaptive Accept RejecT for nonlinear topK subset identification
We consider the bandit problem of selecting K out of N arms at each time...
Cross Layer Optimization and Distributed Reinforcement Learning Approach for TileBased 360 Degree Wireless Video Streaming
Wirelessly streaming high quality 360 degree videos is still a challengi...
Secure Regenerating Codes for Reducing Storage and Bootstrap Costs in Sharded Blockchains
Blockchain is a distributed ledger with wide applications. Due to the in...
A Distributed ModelFree RideSharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning
Significant development of ridesharing services presents a plethora of ...
An Embedded Index Code Construction Using Subpacketization
A variant of the index coding problem (ICP), the embedded index coding p...
FlexPool: A Distributed ModelFree Deep Reinforcement Learning Algorithm for Joint Passengers Goods Transportation
The growth in online goods delivery is causing a dramatic surge in urban...
MultiStage Hybrid Federated Learning over LargeScale Wireless Fog Networks
One of the popular methods for distributed machine learning (ML) is fede...
ModelFree Algorithm and Regret Analysis for MDPs with LongTerm Constraints
In the optimization of dynamical systems, the variables typically have c...
From Federated Learning to Fog Learning: Towards LargeScale Distributed Machine Learning in Heterogeneous Wireless Networks
Contemporary network architectures are pushing computing tasks from the ...
Modeling and Optimization of Latency in Erasurecoded Storage Systems
As consumers are increasingly engaged in social networking and Ecommerc...
Efficient Gaussian Process Bandits by Believing only Informative Actions
Bayesian optimization is a framework for global search via maximum a pos...
ModelFree Algorithm and Regret Analysis for MDPs with Peak Constraints
In the optimization of dynamic systems, the variables typically have con...
Grand Challenges of Resilience: Autonomous System Resilience through Design and Runtime Measures
A set of about 80 researchers, practitioners, and federal agency program...
Optimal Server Selection for Straggler Mitigation
The performance of largescale distributed compute systems is adversely ...
A Distributed ModelFree Algorithm for Multihop Ridesharing using Deep Reinforcement Learning
The growth of autonomous vehicles, ridesharing systems, and self driving...
QGADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning
In this paper, we propose a communicationefficient decentralized machin...
Escaping Saddle Points for Zerothorder Nonconvex Optimization using Estimated Gradient Descent
Gradient descent and its variants are widely used in machine learning. H...
Encoders and Decoders for Quantum Expander Codes Using Machine Learning
Quantum key distribution (QKD) allows two distant parties to share encry...
A Reinforcement Learning Based Approach for Joint MultiAgent Decision Making
Reinforcement Learning (RL) is being increasingly applied to optimize co...
Straggler Mitigation with Tiered Gradient Codes
Coding theoretic techniques have been proposed for synchronous Gradient ...
GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning
When the data is distributed across multiple servers, efficient data exc...
Reinforcement Learning for Mean Field Game
Stochastic games provide a framework for interactions among multiagents...
DeepPool: Distributed Modelfree Algorithm for Ridesharing using Deep Reinforcement Learning
The success of modern ridesharing platforms crucially depends on the pr...
A Proximal Jacobian ADMM Approach for Fast Massive MIMO Signal Detection in LowLatency Communications
One of the 5G promises is to provide Ultra Reliable Low Latency Communic...
Multitier Caching Analysis in CDNbased Overthetop Video Streaming Systems
Internet video traffic has been been rapidly increasing and is further e...
Joint Information Freshness and Completion Time Optimization for Vehicular Networks
The demand for realtime cloud applications has seen an unprecedented gr...
A Robust Algorithm for Tilebased 360degree Video Streaming with Uncertain FoV Estimation
We propose a robust scheme for streaming 360degree immersive videos to ...
Optimized Portfolio Contracts for Bidding the Cloud
Amazon EC2 provides two most popular pricing schemesi) the costly on...
Regret Bounds for Stochastic Combinatorial MultiArmed Bandits with Linear Space Complexity
Many realworld problems face the dilemma of choosing best K out of N op...
GroupCast: PreferenceAware Cooperative Video Streaming with Scalable Video Coding
In this paper, we propose a preferenceaware cooperative video streaming...
Covfefe: A Computer Vision Approach For Estimating Force Exertion
Cumulative exposure to repetitive and forceful activities may lead to mu...
On the Optimal Broadcast Rate of the TwoSender Unicast Index Coding Problem with FullyParticipated Interactions
The problem of twosender unicast index coding consists of two senders a...
A Low Complexity Detection Algorithm Based on Alternating Minimization
In this paper, we propose an algorithm based on the Alternating Minimiza...
Optimal Linear Broadcast Rates of the TwoSender Unicast Index Coding Problem with FullyParticipated Interactions
The twosender unicast index coding problem consists of finding optimal ...
QoEAware Resource Allocation for Small Cells
In this paper, we study the problem of Quality of Experience (QoE) aware...
Vaneet Aggarwal
Assistant Professor with the School of Industrial Engineering at Purdue University