We study robust reinforcement learning (RL) with the goal of determining...
We provide a new information-theoretic generalization error bound that i...
We study policy optimization for Markov decision processes (MDPs) with
m...
We study the problem of weakly private information retrieval (W-PIR), wh...
We study the effect of reward variance heterogeneity in the approximate
...
We analyze the performance of the Borda counting algorithm in a
non-para...
We propose a new approach to apply the chaining technique in conjunction...
We address the issue of safety in reinforcement learning. We pose the pr...
We address the issue of safety in reinforcement learning. We pose the pr...
In the conventional robust T-colluding private information retrieval (PI...
We propose a new information-theoretic bound on generalization error bas...
In a private information retrieval (PIR) system, the user needs to retri...
We consider information leakage to the user in private information retri...
We consider constructing capacity-achieving linear codes with minimum me...
In this paper, we investigate the impact of diverse user preference on
l...
In this paper, we propose a cost-aware cascading bandits model, a new va...
In this paper, we investigate cost-aware joint learning and optimization...
We consider a variant of the classic multi-armed bandit problem where th...