
Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep ActorCritic Reinforcement Learning
Constructing agents with planning capabilities has long been one of the ...
read it

EventTriggered Multiagent Reinforcement Learning with Communication under Limitedbandwidth Constraint
Communicating with each other in a distributed manner and behaving as a ...
read it

Dynamic Horizon Value Estimation for Modelbased Reinforcement Learning
Existing modelbased value expansion methods typically leverage a world ...
read it

Dynamic Knapsack Optimization Towards Efficient MultiChannel Sequential Advertising
In Ecommerce, advertising is essential for merchants to reach their tar...
read it

TripleGAIL: A MultiModal Imitation Learning Framework with Generative Adversarial Nets
Generative adversarial imitation learning (GAIL) has shown promising res...
read it

Continuous Multiagent Control using Collective Behavior Entropy for LargeScale Home Energy Management
With the increasing popularity of electric vehicles, distributed energy ...
read it

Learning to Accelerate Heuristic Searching for LargeScale Maximum Weighted bMatching Problems in Online Advertising
Bipartite bmatching is fundamental in algorithm design, and has been wi...
read it

CausalVAE: Structured Causal Disentanglement in Variational Autoencoder
Learning disentanglement aims at finding a low dimensional representatio...
read it

Efficient Deep Reinforcement Learning through Policy Transfer
Transfer Learning (TL) has shown great potential to accelerate Reinforce...
read it

Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework
Transfer Learning has shown great potential to enhance the singleagent ...
read it

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
Reinforcement learning agents usually learn from scratch, which requires...
read it

Qvalue Path Decomposition for Deep Multiagent Reinforcement Learning
Recently, deep multiagent reinforcement learning (MARL) has become a hig...
read it

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning
In many realworld settings, a team of cooperative agents must learn to ...
read it

There is Limited Correlation between Coverage and Robustness for Deep Neural Networks
Deep neural networks (DNN) are increasingly applied in safetycritical s...
read it

Diverse Behavior Is What Game AI Needs: Generating Varied HumanLike Playing Styles Using Evolutionary MultiObjective Deep Reinforcement Learning
Designing artificial intelligence for games (Game AI) has been long reco...
read it

Efficient meta reinforcement learning via meta goal generation
Meta reinforcement learning (metaRL) is able to accelerate the acquisit...
read it

Independent Generative Adversarial SelfImitation Learning in Cooperative Multiagent Systems
Many tasks in practice require the collaboration of multiple agents thro...
read it

From Few to More: Largescale Dynamic Multiagent Curriculum Learning
A lot of efforts have been devoted to investigating how agents can learn...
read it

Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
In multiagent systems (MASs), each agent makes individual decisions but ...
read it

Spectralbased Graph Convolutional Network for Directed Graphs
Graph convolutional networks(GCNs) have become the most popular approach...
read it

Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction
Value functions are crucial for modelfree Reinforcement Learning (RL) t...
read it

Deep MultiAgent Reinforcement Learning with DiscreteContinuous Hybrid Action Spaces
Deep Reinforcement Learning (DRL) has been applied to address a variety ...
read it

Hierarchical Deep Multiagent Reinforcement Learning
Despite deep reinforcement learning has recently achieved great successe...
read it

SCCrFMQ Learning in Cooperative Markov Games with Continuous Actions
Although many reinforcement learning methods have been proposed for lear...
read it

BayesToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents
Multiagent algorithms often aim to accurately predict the behaviors of o...
read it

Learning to Advertise with Adaptive Exposure via Constrained TwoLevel Reinforcement Learning
For online advertising in ecommerce, the traditional problem is to assi...
read it

An Optimal Rewiring Strategy for Reinforcement Social Learning in Cooperative Multiagent Systems
Multiagent coordination in cooperative multiagent systems (MASs) has bee...
read it

Falsification of CyberPhysical Systems Using Deep Reinforcement Learning
With the rapid development of software and distributed computing, Cyber...
read it

Hierarchical Heuristic Learning towards Effcient Norm Emergence
Social norms serve as an important mechanism to regulate the behaviors o...
read it

SAIGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes
In multiagent environments, the capability of learning is important for ...
read it

Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach
The Iterated Prisoner's Dilemma has guided research on social dilemmas f...
read it

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
Despite single agent deep reinforcement learning has achieved significan...
read it

Blind Image Denoising via Dependent Dirichlet Process Tree
Most existing image denoising approaches assumed the noise to be homogen...
read it
Jianye Hao
is this you? claim profile