b'Kaixin Wang'

research

∙ 08/03/2023

ClassEval: A Manually-Crafted Benchmark for Evaluating LLMs on Class-level Code Generation

In this work, we make the first attempt to evaluate LLMs in a more chall...

0 Xueying Du, et al. ∙

research

∙ 06/09/2023

Robust Reinforcement Learning via Adversarial Kernel Approximation

Robust Markov Decision Processes (RMDPs) provide a framework for sequent...

0 Kaixin Wang, et al. ∙

research

∙ 05/07/2023

No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation

Unit testing is essential in detecting bugs in functionally-discrete pro...

0 Zhiqiang Yuan, et al. ∙

research

∙ 01/31/2023

An Efficient Solution to s-Rectangular Robust Markov Decision Processes

We present an efficient robust value iteration for -rectangular robust M...

0 Navdeep Kumar, et al. ∙

research

∙ 11/13/2022

Reinforcement Learning Enhanced Weighted Sampling for Accurate Subgraph Counting on Fully Dynamic Graph Streams

As the popularity of graph data increases, there is a growing need to co...

0 Kaixin Wang, et al. ∙

research

∙ 10/24/2022

Reachability-Aware Laplacian Representation in Reinforcement Learning

In Reinforcement Learning (RL), Laplacian Representation (LapRep) is a t...

0 Kaixin Wang, et al. ∙

research

∙ 10/03/2022

Policy Gradient for Reinforcement Learning with General Utilities

In Reinforcement Learning (RL), the goal of agents is to discover an opt...

0 Navdeep Kumar, et al. ∙

research

∙ 09/20/2022

Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

The cooperative Multi-A gent R einforcement Learning (MARL) with permuta...

1 Fengzhuo Zhang, et al. ∙

research

∙ 05/28/2022

Efficient Policy Iteration for Robust Markov Decision Processes via Regularization

Robust Markov decision processes (MDPs) provide a general framework to m...

0 Navdeep Kumar, et al. ∙

research

∙ 05/23/2022

Tyger: Task-Type-Generic Active Learning for Molecular Property Prediction

How to accurately predict the properties of molecules is an essential pr...

8 Kuangqi Zhou, et al. ∙

research

∙ 01/30/2022

The Geometry of Robust Value Functions

The space of value functions is a fundamental concept in reinforcement l...

0 Kaixin Wang, et al. ∙

research

∙ 07/12/2021

Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing

The Laplacian representation recently gains increasing attention for rei...

26 Kaixin Wang, et al. ∙

research

∙ 10/21/2020

Improving Generalization in Reinforcement Learning with Mixture Regularization

Deep reinforcement learning (RL) agents trained in a limited set of envi...

0 Kaixin Wang, et al. ∙

research

∙ 08/18/2019

PANet: Few-Shot Image Semantic Segmentation with Prototype Alignment

Despite the great progress made by deep CNNs in image semantic segmentat...

13 Kaixin Wang, et al. ∙

research

∙ 07/12/2019

Deep Model Compression via Filter Auto-sampling

The recent WSNet [1] is a new model compression method through sampling ...

1 Daquan Zhou, et al. ∙

Kaixin Wang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro