Nan Jiang

research

∙ 09/16/2023

Solving Satisfiability Modulo Counting for Symbolic and Statistical AI Integration With Provable Guarantees

Satisfiability Modulo Counting (SMC) encompasses problems that require b...

0 Jinzhao Li, et al. ∙

research

∙ 09/13/2023

Racing Control Variable Genetic Programming for Symbolic Regression

Symbolic regression, as one of the most crucial tasks in AI for science,...

0 Nan Jiang, et al. ∙

research

∙ 09/04/2023

Marginalized Importance Sampling for Off-Environment Policy Evaluation

Reinforcement Learning (RL) methods are typically sample-inefficient, ma...

0 Pulkit Katdare, et al. ∙

research

∙ 07/25/2023

The Optimal Approximation Factors in Misspecified Off-Policy Value Function Estimation

Theoretical guarantees in reinforcement learning (RL) are known to suffe...

0 Philip Amortila, et al. ∙

research

∙ 07/09/2023

Ultrasonic Image's Annotation Removal: A Self-supervised Noise2Noise Approach

Accurately annotated ultrasonic images are vital components of a high-qu...

0 Yuanheng Zhang, et al. ∙

research

∙ 06/06/2023

Impact of Large Language Models on Generating Software Specifications

Software specifications are essential for ensuring the reliability of so...

0 Danning Xie, et al. ∙

research

∙ 06/05/2023

LmPa: Improving Decompilation by Synergy of Large Language Model and Program Analysis

Decompilation aims to recover the source code form of a binary executabl...

0 Xiangzhe Xu, et al. ∙

research

∙ 05/29/2023

How Effective Are Neural Networks for Fixing Security Vulnerabilities

Security vulnerability repair is a difficult task that is in dire need o...

0 Yi Wu, et al. ∙

research

∙ 05/25/2023

Symbolic Regression via Control Variable Genetic Programming

Learning symbolic expressions directly from experiment data is a vital s...

0 Nan Jiang, et al. ∙

research

∙ 05/22/2023

LM-Switch: Lightweight Language Model Conditioning in Word Embedding Space

In recent years, large language models (LMs) have achieved remarkable pr...

0 Chi Han, et al. ∙

research

∙ 05/06/2023

Explaining RL Decisions with Trajectories

Explanation is a key component for the adoption of reinforcement learnin...

11 Shripad Vilasrao Deshmukh, et al. ∙

research

∙ 03/11/2023

Secure and Multi-Step Computation Offloading and Resource Allocation in Ultra-Dense Multi-Task NOMA-Enabled IoT Networks

Ultra-dense networks are widely regarded as a promising solution to expl...

0 Tianqing Zhou, et al. ∙

research

∙ 02/21/2023

Adversarial Model for Offline Reinforcement Learning

We propose a novel model-based offline Reinforcement Learning (RL) frame...

0 Mohak Bhardwaj, et al. ∙

research

∙ 02/10/2023

Impact of Code Language Models on Automated Program Repair

Automated program repair (APR) aims to help developers improve software ...

0 Nan Jiang, et al. ∙

research

∙ 02/06/2023

Offline Learning in Markov Games with General Function Approximation

We study offline multi-agent reinforcement learning (RL) in Markov games...

3 Yuheng Zhang, et al. ∙

research

∙ 02/04/2023

Reinforcement Learning in Low-Rank MDPs with Density Features

MDPs with low-rank transitions – that is, the transition matrix can be f...

0 Audrey Huang, et al. ∙

research

∙ 02/03/2023

KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair

Automated Program Repair (APR) improves software reliability by generati...

3 Nan Jiang, et al. ∙

research

∙ 12/19/2022

Semantics-Aware Remote Estimation via Information Bottleneck-Inspired Type Based Multiple Access

Type-based multiple access (TBMA) is a semantics-aware multiple access p...

0 Meiyi Zhu, et al. ∙

research

∙ 12/07/2022

Toward Multi-Service Edge-Intelligence Paradigm: Temporal-Adaptive Prediction for Time-Critical Control over Wireless

Time-critical control applications typically pose stringent connectivity...

0 Adnan Aijaz, et al. ∙

research

∙ 12/01/2022

Learning Combinatorial Structures via Markov Random Fields with Sampling through Lovász Local Lemma

Generative models for learning combinatorial structures have transformat...

0 Nan Jiang, et al. ∙

research

∙ 11/08/2022

ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data

We propose a new model-based offline RL framework, called Adversarial Mo...

0 Tengyang Xie, et al. ∙

research

∙ 10/27/2022

Beyond the Return: Off-policy Function Estimation under User-specified Error-measuring Distributions

Off-policy evaluation often refers to two related tasks: estimating the ...

0 Audrey Huang, et al. ∙

research

∙ 10/09/2022

The Role of Coverage in Online Reinforcement Learning

Coverage conditions – which assert that the data logging distribution ad...

0 Tengyang Xie, et al. ∙

research

∙ 09/06/2022

Second order, unconditionally stable, linear ensemble algorithms for the magnetohydrodynamics equations

We propose two unconditionally stable, linear ensemble algorithms with p...

0 John Carter, et al. ∙

research

∙ 08/11/2022

On the Value of Behavioral Representations for Dense Retrieval

We consider text retrieval within dense representational space in real-w...

0 Nan Jiang, et al. ∙

research

∙ 07/26/2022

Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

We study off-policy evaluation (OPE) for partially observable MDPs (POMD...

3 Masatoshi Uehara, et al. ∙

research

∙ 07/18/2022

A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

The current paper studies sample-efficient Reinforcement Learning (RL) i...

0 Philip Amortila, et al. ∙

research

∙ 06/21/2022

On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

We study reward-free reinforcement learning (RL) under general non-linea...

0 Jinglin Chen, et al. ∙

research

∙ 06/16/2022

Interaction-Grounded Learning with Action-inclusive Feedback

Consider the problem setting of Interaction-Grounded Learning (IGL), in ...

2 Tengyang Xie, et al. ∙

research

∙ 05/25/2022

Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret

We propose a new learning framework that captures the tiered structure o...

0 Jiawei Huang, et al. ∙

research

∙ 03/25/2022

Offline Reinforcement Learning Under Value and Density-Ratio Realizability: the Power of Gaps

We consider a challenging theoretical problem in offline reinforcement l...

0 Jinglin Chen, et al. ∙

research

∙ 03/15/2022

ActFormer: A GAN Transformer Framework towards General Action-Conditioned 3D Human Motion Generation

We present a GAN Transformer framework for general action-conditioned 3D...

0 Ziyang Song, et al. ∙

research

∙ 02/14/2022

Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality

Deployment efficiency is an important criterion for many real-world appl...

0 Jiawei Huang, et al. ∙

research

∙ 02/09/2022

Offline Reinforcement Learning with Realizability and Single-policy Concentrability

Sample-efficiency guarantees for offline reinforcement learning (RL) oft...

0 Wenhao Zhan, et al. ∙

research

∙ 02/05/2022

Adversarially Trained Actor Critic for Offline Reinforcement Learning

We propose Adversarially Trained Actor Critic (ATAC), a new model-free a...

0 Ching-An Cheng, et al. ∙

research

∙ 11/12/2021

A Minimax Learning Approach to Off-Policy Evaluation in Partially Observable Markov Decision Processes

We consider off-policy evaluation (OPE) in Partially Observable Markov D...

0 Chengchun Shi, et al. ∙

research

∙ 10/26/2021

Towards Hyperparameter-free Policy Selection for Offline Reinforcement Learning

How to select between policies and value functions produced by different...

0 Siyuan Zhang, et al. ∙

research

∙ 10/06/2021

A Fast Randomized Algorithm for Massive Text Normalization

Many popular machine learning techniques in natural language processing ...

0 Nan Jiang, et al. ∙

research

∙ 09/22/2021

A Spectral Approach to Off-Policy Evaluation for POMDPs

We consider off-policy evaluation (OPE) in Partially Observable Markov D...

0 Yash Nair, et al. ∙

research

∙ 09/03/2021

Recursive Periodicity Shifting for Semi-Persistent Scheduling of Time-Sensitive Communication in 5G

Various legacy and emerging industrial control applications create the r...

0 Nan Jiang, et al. ∙

research

∙ 08/17/2021

Two linear, unconditionally stable, second order decoupling methods for the Allen–Cahn–Navier–Stokes phase field model

Hydrodynamics coupled phase field models have intricate difficulties to ...

0 Ruonan Cao, et al. ∙

research

∙ 08/09/2021

A Self-Configurable Grouping Method for Integrated Wi-SUN FAN and TSCH-based Networks

Recent applications in large-scale wireless mesh networks (WSN), e.g., A...

0 Xinyu Ni, et al. ∙

research

∙ 07/07/2021

Group Sampling for Unsupervised Person Re-identification

Unsupervised person re-identification (re-ID) remains a challenging task...

7 Xumeng Han, et al. ∙

research

∙ 06/13/2021

Bellman-consistent Pessimism for Offline Reinforcement Learning

The use of pessimism, when reasoning about datasets lacking exhaustive e...

0 Tengyang Xie, et al. ∙

research

∙ 06/09/2021

Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning

Recent theoretical work studies sample-efficient reinforcement learning ...

10 Tengyang Xie, et al. ∙

research

∙ 06/02/2021

On the Convergence Rate of Off-Policy Policy Optimization Methods with Density-Ratio Correction

In this paper, we study the convergence properties of off-policy policy ...

0 Jiawei Huang, et al. ∙

research

∙ 04/28/2021

Evaluating the Performance of Over-the-Air Time Synchronization for 5G and TSN Integration

The IEEE 802.1 time-sensitive networking (TSN) standards aim at improvin...

0 Haochuan Shi, et al. ∙

research

∙ 04/14/2021

A second order ensemble method based on a blended BDF timestepping scheme for time dependent Navier-Stokes equations

We present a second order ensemble method based on a blended three-step ...

0 Nan Jiang, et al. ∙

research

∙ 03/02/2021

Minimax Model Learning

We present a novel off-policy loss function for learning a transition mo...

18 Cameron Voloshin, et al. ∙

research

∙ 02/26/2021

CURE: Code-Aware Neural Machine Translation for Automatic Program Repair

Automatic program repair (APR) is crucial to improve software reliabilit...

0 Nan Jiang, et al. ∙

Nan Jiang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro