
-
Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State
We design a simple reinforcement learning agent that, with a specificati...
read it
-
Using Undersampling with Ensemble Learning to Identify Factors Contributing to Preterm Birth
In this paper, we propose Ensemble Learning models to identify factors c...
read it
-
Provably Efficient Reinforcement Learning with Aggregated States
We establish that an optimistic variant of Q-learning applied to a finit...
read it
-
Summarizing CPU and GPU Design Trends with Product Data
Moore's Law and Dennard Scaling have guided the semiconductor industry f...
read it
-
Comments on the Du-Kakade-Wang-Yang Lower Bounds
Du, Kakade, Wang, and Yang recently established intriguing lower bounds ...
read it
-
On the Performance of Thompson Sampling on Logistic Bandits
We study the logistic bandit, in which rewards are binary with success p...
read it
-
MGSim + MGMark: A Framework for Multi-GPU System Research
The rapidly growing popularity and scale of data-parallel workloads dema...
read it
-
An Information-Theoretic Analysis for Thompson Sampling with Many Actions
Information-theoretic Bayesian regret bounds of Russo and Van Roy captur...
read it
-
An Information-Theoretic Analysis of Thompson Sampling for Large Action Spaces
Information-theoretic Bayesian regret bounds of Russo and Van Roy captur...
read it