
-
Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance
This paper investigates the optimization problem of an infinite stage di...
read it
-
SOAC: The Soft Option Actor-Critic Architecture
The option framework has shown great promise by automatically extracting...
read it
-
Embedding-based Retrieval in Facebook Search
Search in social networks such as Facebook poses different challenges th...
read it
-
Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape Exploration
The generative adversarial imitation learning (GAIL) has provided an adv...
read it
-
Distributional Soft Actor Critic for Risk Sensitive Learning
Most of reinforcement learning (RL) algorithms aim at maximizing the exp...
read it
-
Multi-action Offline Policy Learning with Bayesian Optimization
We study an offline multi-action policy learning algorithm based on doub...
read it
-
An Overview for Markov Decision Processes in Queues and Networks
Markov decision processes (MDPs) in queues and networks have been an int...
read it
-
Optimal Asynchronous Dynamic Policies in Energy-Efficient Data Centers
In this paper, we use a Markov decision process to find optimal asynchro...
read it
-
Group-Server Queues
By analyzing energy-efficient management of data centers, this paper pro...
read it