b'Qian Zheng'

research

∙ 06/19/2023

Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification

Ad hoc teamwork requires an agent to cooperate with unknown teammates wi...

0 Dong Xing, et al. ∙

research

∙ 02/23/2023

Evaluating the Efficacy of Skincare Product: A Realistic Short-Term Facial Pore Simulation

Simulating the effects of skincare products on face is a potential new w...

0 Ling Li, et al. ∙

research

∙ 05/01/2022

TinyLight: Adaptive Traffic Signal Control on Devices with Extremely Limited Resources

Recent advances in deep reinforcement learning (DRL) have largely promot...

0 Dong Xing, et al. ∙

research

∙ 03/30/2022

Automatic Facial Skin Feature Detection for Everyone

Automatic assessment and understanding of facial skin condition have sev...

0 Qian Zheng, et al. ∙

research

∙ 12/02/2020

Sample Complexity of Policy Gradient Finding Second-Order Stationary Points

The goal of policy-based reinforcement learning (RL) is to search the ma...

0 Long Yang, et al. ∙

research

∙ 08/20/2020

Object Properties Inferring from and Transfer for Human Interaction Motions

Humans regularly interact with their surrounding objects. Such interacti...

0 Qian Zheng, et al. ∙

research

∙ 10/24/2019

Emotion recognition with 4kresolution database

Classifying the human emotion through facial expressions is a big topic ...

19 Qian Zheng, et al. ∙

research

∙ 09/06/2019

Gradient Q(σ, λ): A Unified Algorithm with Function Approximation for Reinforcement Learning

Full-sampling (e.g., Q-learning) and pure-expectation (e.g., Expected Sa...

0 Long Yang, et al. ∙

research

∙ 05/10/2019

SPLINE-Net: Sparse Photometric Stereo through Lighting Interpolation and Normal Estimation Networks

This paper solves the Sparse Photometric stereo through Lighting Interpo...

5 Qian Zheng, et al. ∙

research

∙ 11/12/2018

Exploiting Local Feature Patterns for Unsupervised Domain Adaptation

Unsupervised domain adaptation methods aim to alleviate performance degr...

0 Jun Wen, et al. ∙

research

∙ 06/14/2018

Qualitative Measurements of Policy Discrepancy for Return-based Deep Q-Network

In this paper, we focus on policy discrepancy in return-based deep Q-net...

0 Wenjia Meng, et al. ∙

research

∙ 02/09/2018

A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning

Recently, a new multi-step temporal learning algorithm, called Q(σ), uni...

0 Long Yang, et al. ∙

Qian Zheng

Featured Co-authors

Sign in with Google

Consider DeepAI Pro