We study the trade-off between expectation and tail risk for regret
dist...
We formulate, analyze and solve the problem of best arm identification w...
Plasticity, the ability of a neural network to quickly change its predic...
In biomedical science, analyzing treatment effect heterogeneity plays an...
We study a game between liquidity provider and liquidity taker agents
in...
Constructing asymptotically valid confidence intervals through a valid
c...
Nonsmooth nonconvex optimization problems broadly emerge in machine lear...
Understanding the impact of the most effective policies or treatments on...
We design new policies that ensure both worst-case optimality for expect...
Planning with a learned model is arguably a key component of intelligenc...
We formulate selecting the best optimizing system (SBOS) problems and pr...
Motivated by emerging applications such as live-streaming e-commerce,
pr...
Conditional Generative Adversarial Networks are known to be difficult to...
How much credit (or blame) should an action taken in a state get for a f...
In this work, we study auxiliary prediction tasks defined by
temporal-di...
We propose a new framework named DS-WGAN that integrates the doubly
stoc...
Optimal transport (OT) distances are increasingly used as loss functions...
Optical multi-layer thin films are widely used in optical and energy
app...
Reinforcement learning agents can include different components, such as
...
Ethereum is an open-source, public, blockchain-based distributed computi...
In many sequential decision making tasks, it is challenging to design re...
Deep learning models can take weeks to train on a single GPU-equipped
ma...