
-
Optimal Learning for Sequential Decisions in Laboratory Experimentation
The process of discovery in the physical, biological and medical science...
read it
-
On State Variables, Bandit Problems and POMDPs
State variables are easily the most subtle dimension of sequential decis...
read it
-
Zeroth-order Stochastic Compositional Algorithms for Risk-Aware Learning
We present Free-MESSAGEp, the first zeroth-order algorithm for convex me...
read it
-
From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions
There are over 15 distinct communities that work in the general area of ...
read it
-
Approximate Dynamic Programming for Planning a Ride-Sharing System using Autonomous Fleets of Electric Vehicles
Within a decade, almost every major auto company, along with fleet opera...
read it
-
Recursive Optimization of Convex Risk Measures: Mean-Semideviation Models
We develop and analyze stochastic subgradient methods for optimizing a n...
read it
-
Monte Carlo Tree Search with Sampled Information Relaxation Dual Bounds
Monte Carlo Tree Search (MCTS), most famously used in game-play artifici...
read it
-
Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures
In this paper, we consider a finite-horizon Markov decision process (MDP...
read it
-
A Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief Model
We present a sparse knowledge gradient (SpKG) algorithm for adaptively s...
read it
-
A New Optimal Stepsize For Approximate Dynamic Programming
Approximate dynamic programming (ADP) has proven itself in a wide range ...
read it
-
Dirichlet Process Mixtures of Generalized Linear Models
We propose Dirichlet Process mixtures of Generalized Linear Models (DP-G...
read it