Canzhe Zhao | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Cheng Chen
78 publications
Kun Wang
72 publications
Jing Dong
67 publications
Tong Yu
38 publications
Baoxiang Wang
25 publications
Shuo Shao
17 publications
Fang Kong
8 publications
Zhihui Xie
5 publications
Yanjie Ze
3 publications

research

∙ 08/19/2023

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

Communication lays the foundation for cooperation in human society and i...

0 Canzhe Zhao, et al. ∙

research

∙ 03/13/2023

Best-of-three-worlds Analysis for Linear Bandits with Follow-the-regularized-leader Algorithm

The linear bandit problem has been studied for many years in both stocha...

0 Fang Kong, et al. ∙

research

∙ 08/21/2022

Comparison-based Conversational Recommender System with Relative Bandit Feedback

With the recent advances of conversational recommendations, the recommen...

0 Zhihui Xie, et al. ∙

research

∙ 07/12/2022

Simultaneously Learning Stochastic and Adversarial Bandits under the Position-Based Model

Online learning to rank (OLTR) interactively learns to choose lists of i...

0 Cheng Chen, et al. ∙

research

∙ 01/25/2022

Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization

Temporal difference (TD) learning is a widely used method to evaluate po...

0 Canzhe Zhao, et al. ∙

research

∙ 04/17/2021

Conservative Contextual Combinatorial Cascading Bandit

Conservative mechanism is a desirable property in decision-making proble...

5 Kun Wang, et al. ∙