Ad hoc teamwork requires an agent to cooperate with unknown teammates wi...
Simulating the effects of skincare products on face is a potential new w...
Recent advances in deep reinforcement learning (DRL) have largely promot...
Automatic assessment and understanding of facial skin condition have sev...
The goal of policy-based reinforcement learning (RL) is to search the ma...
Humans regularly interact with their surrounding objects. Such interacti...
Classifying the human emotion through facial expressions is a big topic ...
Full-sampling (e.g., Q-learning) and pure-expectation (e.g., Expected Sa...
This paper solves the Sparse Photometric stereo through Lighting
Interpo...
Unsupervised domain adaptation methods aim to alleviate performance
degr...
In this paper, we focus on policy discrepancy in return-based deep Q-net...
Recently, a new multi-step temporal learning algorithm, called Q(σ),
uni...