In the past few years, AlphaZero's exceptional capability in mastering
i...
Truss layout design, namely finding a lightweight truss layout satisfyin...
The paradigm of pre-training followed by fine-tuning on downstream tasks...
We aim to control a robot to physically behave in the real world followi...
Aiming at promoting the safe real-world deployment of Reinforcement Lear...
Reinforcement Learning(RL) has achieved tremendous development in recent...
This paper investigates the multi-agent navigation problem, which requir...
There is a recent trend of applying multi-agent reinforcement learning (...
We consider the problem of cooperative exploration where multiple robots...
Deep reinforcement learning (DRL) requires the collection of plenty of
i...
Data compression has been widely adopted to release mobile devices from
...
In this paper, we study the robust optimal investment and risk control
p...
Insider information and model uncertainty are two unavoidable problems f...
Many advances in cooperative multi-agent reinforcement learning (MARL) a...
Accurate estimation of post-click conversion rate is critical for buildi...
Hierarchical Text Classification (HTC) is a challenging task where a doc...
Discovering hazardous scenarios is crucial in testing and further improv...
We present Coordinated Proximal Policy Optimization (CoPPO), an algorith...
We consider the task of visual indoor exploration with multiple agents, ...
Although many methods for computing the Greeks of discrete-time Asian op...
In recent years, quantitative investment methods combined with artificia...
Proximal Policy Optimization (PPO) is a popular on-policy reinforcement
...
Aspect based sentiment analysis (ABSA) involves three fundamental subtas...
Autonomous driving vehicles (ADVs) are implemented with rich software
fu...
In this paper, a novel neural network activation function, called Symmet...
As a subfield of machine learning, reinforcement learning (RL) aims at
e...
Providing reinforcement learning agents with informationally rich human
...
Achieving coordination is crucial in organizational control. This paper
...
Simultaneous Localization and Mapping (SLAM) is considered to be a
funda...