b'Yu Bai'

research

∙ 09/11/2023

An Empirical Study of NetOps Capability of Pre-Trained Large Language Models

Nowadays, the versatile capabilities of Pre-trained Large Language Model...

0 Yukai Miao, et al. ∙

research

∙ 07/21/2023

What can a Single Attention Layer Learn? A Study Through the Random Features Lens

Attention layers – which map a sequence of inputs to a sequence of outpu...

0 Hengyu Fu, et al. ∙

research

∙ 07/06/2023

Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight

This paper studies the sample-efficiency of learning in Partially Observ...

0 Jiacheng Guo, et al. ∙

research

∙ 06/29/2023

Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications

Voicebots have provided a new avenue for supporting the development of l...

0 Simone Wills, et al. ∙

research

∙ 06/07/2023

Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection

Neural sequence models based on the transformer architecture have demons...

6 Yu Bai, et al. ∙

research

∙ 06/07/2023

An ASR-Based Tutor for Learning to Read: How to Optimize Feedback to First Graders

The interest in employing automatic speech recognition (ASR) in applicat...

5 Yu Bai, et al. ∙

research

∙ 06/02/2023

Efficient RL with Impaired Observability: Learning to Act with Delayed and Missing State Observations

In real-world reinforcement learning (RL) systems, various forms of impa...

5 Minshuo Chen, et al. ∙

research

∙ 02/15/2023

Improved Online Conformal Prediction via Strongly Adaptive Online Learning

We study the problem of uncertainty quantification via prediction sets, ...

1 Aadyot Bhatnagar, et al. ∙

research

∙ 02/13/2023

Breaking the Curse of Multiagency: Provably Efficient Decentralized Multi-Agent RL with Function Approximation

A unique challenge in Multi-Agent Reinforcement Learning (MARL) is the c...

1 Yuanhao Wang, et al. ∙

research

∙ 02/06/2023

Offline Learning in Markov Games with General Function Approximation

We study offline multi-agent reinforcement learning (RL) in Markov games...

3 Yuheng Zhang, et al. ∙

research

∙ 02/02/2023

Lower Bounds for Learning in Revealing POMDPs

This paper studies the fundamental limits of reinforcement learning (RL)...

9 Fan Chen, et al. ∙

research

∙ 10/23/2022

Conformal Predictor for Improving Zero-shot Text Classification Efficiency

Pre-trained language models (PLMs) have been shown effective for zero-sh...

0 Prafulla Kumar Choubey, et al. ∙

research

∙ 10/20/2022

Learning Rationalizable Equilibria in Multiplayer Games

A natural goal in multiagent learning besides finding equilibria is to l...

2 Yuanhao Wang, et al. ∙

research

∙ 10/09/2022

The Role of Coverage in Online Reinforcement Learning

Coverage conditions – which assert that the data logging distribution ad...

0 Tengyang Xie, et al. ∙

research

∙ 09/29/2022

Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms

Partial Observability – where agents can only observe partial informatio...

4 Fan Chen, et al. ∙

research

∙ 09/23/2022

Unified Algorithms for RL with Decision-Estimation Coefficients: No-Regret, PAC, and Reward-Free Learning

Finding unified complexity measures and algorithms for sample-efficient ...

1 Fan Chen, et al. ∙

research

∙ 06/08/2022

Identifying good directions to escape the NTK regime and efficiently learn low-degree plus sparse polynomials

A recent goal in the theory of deep learning is to identify how neural n...

6 Eshaan Nichani, et al. ∙

research

∙ 06/06/2022

Policy Optimization for Markov Games: Unified Framework and Faster Convergence

This paper studies policy optimization algorithms for multi-agent reinfo...

2 Runyu Zhang, et al. ∙

research

∙ 05/30/2022

Efficient Φ-Regret Minimization in Extensive-Form Games via Online Mirror Descent

A conceptually appealing approach for learning Extensive-Form Games (EFG...

3 Yu Bai, et al. ∙

research

∙ 05/15/2022

Sample-Efficient Learning of Correlated Equilibria in Extensive-Form Games

Imperfect-Information Extensive-Form Games (IIEFGs) is a prevalent model...

1 Ziang Song, et al. ∙

research

∙ 02/22/2022

Efficient and Differentiable Conformal Prediction with General Function Classes

Quantifying the data uncertainty in learning tasks is often done by lear...

0 Yu Bai, et al. ∙

research

∙ 02/13/2022

Application of Color Block Code in Image Scaling

Aiming at the high cost of embedding annotation watermark in a narrow sm...

0 Hao Wang, et al. ∙

research

∙ 02/13/2022

Privacy protection based on mask template

Powerful recognition algorithms are widely used in the Internet or impor...

2 Hao Wang, et al. ∙

research

∙ 02/03/2022

Near-Optimal Learning of Extensive-Form Games with Imperfect Information

This paper resolves the open question of designing near-optimal algorith...

0 Yu Bai, et al. ∙

research

∙ 01/03/2022

Analyzing Micro-Founded General Equilibrium Models with Many Agents using Deep Reinforcement Learning

Real economies can be modeled as a sequential imperfect-information game...

3 Michael Curry, et al. ∙

research

∙ 10/08/2021

When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?

Multi-agent reinforcement learning has made substantial empirical progre...

3 Ziang Song, et al. ∙

research

∙ 09/23/2021

Cross-Lingual Language Model Meta-Pretraining

The success of pretrained cross-lingual language models relies on two es...

4 Zewen Chi, et al. ∙

research

∙ 06/10/2021

Understanding the Under-Coverage Bias in Uncertainty Estimation

Estimating the data uncertainty in regression tasks is often done by lea...

17 Yu Bai, et al. ∙

research

∙ 06/09/2021

Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning

Recent theoretical work studies sample-efficient reinforcement learning ...

10 Tengyang Xie, et al. ∙

research

∙ 03/30/2021

Multi-modal Trajectory Prediction for Autonomous Driving with Semantic Map and Dynamic Graph Attention Network

Predicting future trajectories of surrounding obstacles is a crucial tas...

6 Bo Dong, et al. ∙

research

∙ 03/08/2021

Exact Gap between Generalization Error and Uniform Convergence in Random Feature Models

Recent work showed that there could be a large gap between the classical...

6 Zitong Yang, et al. ∙

research

∙ 02/23/2021

Sample-Efficient Learning of Stackelberg Equilibria in General-Sum Games

Real world applications such as economics and policy making often involv...

1 Yu Bai, et al. ∙

research

∙ 02/22/2021

Localized Calibration: Metrics and Recalibration

Probabilistic classifiers output confidence scores along with their pred...

1 Rachel Luo, et al. ∙

research

∙ 02/15/2021

Don't Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification

Modern machine learning models with high accuracy are often miscalibrate...

8 Yu Bai, et al. ∙

research

∙ 02/02/2021

Near-Optimal Offline Reinforcement Learning via Double Variance Reduction

We consider the problem of offline reinforcement learning (RL) – a well-...

45 Ming Yin, et al. ∙

research

∙ 10/12/2020

How Important is the Train-Validation Split in Meta-Learning?

Meta-learning aims to perform fast adaptation on a new task through lear...

10 Yu Bai, et al. ∙

research

∙ 10/04/2020

A Sharp Analysis of Model-based Reinforcement Learning with Self-Play

Model-based algorithms—algorithms that decouple learning of the model an...

0 Qinghua Liu, et al. ∙

research

∙ 07/07/2020

Near Optimal Provable Uniform Convergence in Off-Policy Evaluation for Reinforcement Learning

The Off-Policy Evaluation aims at estimating the performance of target p...

4 Ming Yin, et al. ∙

research

∙ 06/24/2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations

Deep neural networks can empirically perform efficient hierarchical lear...

5 Minshuo Chen, et al. ∙

research

∙ 06/22/2020

Near-Optimal Reinforcement Learning with Self-Play

This paper considers the problem of designing optimal algorithms for rei...

0 Yu Bai, et al. ∙

research

∙ 02/10/2020

Provable Self-Play Algorithms for Competitive Reinforcement Learning

Self-play, where the algorithm learns by playing against itself without ...

0 Yu Bai, et al. ∙

research

∙ 02/10/2020

Taylorized Training: Towards Better Approximation of Neural Network Training at Finite Width

We propose Taylorized training as an initiative towards better understan...

20 Yu Bai, et al. ∙

research

∙ 10/21/2019

Directed-Weighting Group Lasso for Eltwise Blocked CNN Pruning

Eltwise layer is a commonly used structure in the multi-branch deep lear...

0 Ke Zhan, et al. ∙

research

∙ 10/03/2019

Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks

Recent theoretical work has established connections between over-paramet...

0 Yu Bai, et al. ∙

research

∙ 05/30/2019

Provably Efficient Q-Learning with Low Switching Cost

We take initial steps in studying PAC-MDP algorithms with limited adapti...

3 Yu Bai, et al. ∙

research

∙ 03/01/2019

Proximal algorithms for constrained composite optimization, with applications to solving low-rank SDPs

We study a family of (potentially non-convex) constrained optimization p...

0 Yu Bai, et al. ∙

research

∙ 10/25/2018

Subgradient Descent Learns Orthogonal Dictionaries

This paper concerns dictionary learning, i.e., sparse coding, a fundamen...

0 Yu Bai, et al. ∙

research

∙ 10/01/2018

ProxQuant: Quantized Neural Networks via Proximal Operators

To make deep neural networks feasible in resource-constrained environmen...

2 Yu Bai, et al. ∙

research

∙ 06/27/2018

Approximability of Discriminators Implies Diversity in GANs

While Generative Adversarial Networks (GANs) have empirically produced i...

0 Yu Bai, et al. ∙

research

∙ 08/29/2017

CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices

Large-scale deep neural networks (DNNs) are both compute and memory inte...

0 Caiwen Ding, et al. ∙

Yu Bai

Featured Co-authors

Sign in with Google

Consider DeepAI Pro