We propose a general purpose confidence interval procedure (CIP) for
sta...
The performance of decision policies and prediction models often deterio...
CVaR (Conditional Value at Risk) is a risk metric widely used in finance...
We establish strong laws of large numbers and central limit theorems for...
This paper introduces a new algorithm for numerically computing equilibr...
In the analysis of Markov chains and processes, it is sometimes convenie...
In the analysis of Markov chains and processes, it is sometimes convenie...
Much of the literature on optimal design of bandit algorithms is based o...
One of the most widely used methods for solving large-scale stochastic
o...
We study the problem of bounding path-dependent expectations (within any...
We propose a new unbiased estimator for estimating the utility of the op...
We study the behavior of Thompson sampling from the perspective of weak
...
Weather forecast information will very likely find increasing applicatio...
We study the sequential batch learning problem in linear contextual band...
This paper proposes a novel non-parametric multidimensional convex regre...
We present general principles for the design and analysis of unbiased Mo...
We investigate methods of change-point testing and confidence interval
c...
We present a fully nonparametric method to estimate the value function, ...