Nowadays, the versatile capabilities of Pre-trained Large Language Model...
Attention layers – which map a sequence of inputs to a sequence of outpu...
This paper studies the sample-efficiency of learning in Partially Observ...
Voicebots have provided a new avenue for supporting the development of
l...
Neural sequence models based on the transformer architecture have
demons...
The interest in employing automatic speech recognition (ASR) in applicat...
In real-world reinforcement learning (RL) systems, various forms of impa...
We study the problem of uncertainty quantification via prediction sets, ...
A unique challenge in Multi-Agent Reinforcement Learning (MARL) is the c...
We study offline multi-agent reinforcement learning (RL) in Markov games...
This paper studies the fundamental limits of reinforcement learning (RL)...
Pre-trained language models (PLMs) have been shown effective for zero-sh...
A natural goal in multiagent learning besides finding equilibria is to l...
Coverage conditions – which assert that the data logging distribution
ad...
Partial Observability – where agents can only observe partial informatio...
Finding unified complexity measures and algorithms for sample-efficient
...
A recent goal in the theory of deep learning is to identify how neural
n...
This paper studies policy optimization algorithms for multi-agent
reinfo...
A conceptually appealing approach for learning Extensive-Form Games (EFG...
Imperfect-Information Extensive-Form Games (IIEFGs) is a prevalent model...
Quantifying the data uncertainty in learning tasks is often done by lear...
Aiming at the high cost of embedding annotation watermark in a narrow sm...
Powerful recognition algorithms are widely used in the Internet or impor...
This paper resolves the open question of designing near-optimal algorith...
Real economies can be modeled as a sequential imperfect-information game...
Multi-agent reinforcement learning has made substantial empirical progre...
The success of pretrained cross-lingual language models relies on two
es...
Estimating the data uncertainty in regression tasks is often done by lea...
Recent theoretical work studies sample-efficient reinforcement learning ...
Predicting future trajectories of surrounding obstacles is a crucial tas...
Recent work showed that there could be a large gap between the classical...
Real world applications such as economics and policy making often involv...
Probabilistic classifiers output confidence scores along with their
pred...
Modern machine learning models with high accuracy are often miscalibrate...
We consider the problem of offline reinforcement learning (RL) – a
well-...
Meta-learning aims to perform fast adaptation on a new task through lear...
Model-based algorithms—algorithms that decouple learning of the model an...
The Off-Policy Evaluation aims at estimating the performance of target p...
Deep neural networks can empirically perform efficient hierarchical lear...
This paper considers the problem of designing optimal algorithms for
rei...
Self-play, where the algorithm learns by playing against itself without
...
We propose Taylorized training as an initiative towards better
understan...
Eltwise layer is a commonly used structure in the multi-branch deep lear...
Recent theoretical work has established connections between over-paramet...
We take initial steps in studying PAC-MDP algorithms with limited adapti...
We study a family of (potentially non-convex) constrained optimization
p...
This paper concerns dictionary learning, i.e., sparse coding, a fundamen...
To make deep neural networks feasible in resource-constrained environmen...
While Generative Adversarial Networks (GANs) have empirically produced
i...
Large-scale deep neural networks (DNNs) are both compute and memory
inte...