Weitong Zhang

research

∙ 07/11/2023

DNAGPT: A Generalized Pretrained Tool for Multiple DNA Sequence Analysis Tasks

The success of the GPT series proves that GPT can extract general inform...

0 Daoan Zhang, et al. ∙

research

∙ 05/15/2023

Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

Recent studies have shown that episodic reinforcement learning (RL) is n...

4 Kaixuan Ji, et al. ∙

research

∙ 05/09/2023

DynamicKD: An Effective Knowledge Distillation via Dynamic Entropy Correction-Based Distillation for Gap Optimizing

The knowledge distillation uses a high-performance teacher network to gu...

2 Songling Zhu, et al. ∙

research

∙ 03/17/2023

Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs

We study reward-free reinforcement learning (RL) with linear function ap...

4 Junkai Zhang, et al. ∙

research

∙ 03/16/2023

On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits

We study linear contextual bandits in the misspecified setting, where th...

1 Weitong Zhang, et al. ∙

research

∙ 01/24/2022

Learning Contextual Bandits Through Perturbed Rewards

Thanks to the power of representation learning, neural contextual bandit...

4 Yiling Jia, et al. ∙

research

∙ 10/12/2021

Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation

We study the model-based reward-free reinforcement learning with linear ...

8 Weitong Zhang, et al. ∙

research

∙ 06/22/2021

Provably Efficient Representation Learning in Low-rank Markov Decision Processes

The success of deep reinforcement learning (DRL) is due to the power of ...

27 Weitong Zhang, et al. ∙

research

∙ 10/02/2020

Neural Thompson Sampling

Thompson Sampling (TS) is one of the most effective algorithms for solvi...

7 Weitong Zhang, et al. ∙

research

∙ 05/04/2020

A Finite Time Analysis of Two Time-Scale Actor Critic Methods

Actor-critic (AC) methods have exhibited great empirical success compare...

5 Yue Wu, et al. ∙

research

∙ 07/27/2018

Characters Detection on Namecard with faster RCNN

We apply Faster R-CNN to the detection of characters in namecard, in ord...

0 Weitong Zhang, et al. ∙

Weitong Zhang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro