research
∙
10/01/2021
A Cramér Distance perspective on Non-crossing Quantile Regression in Distributional Reinforcement Learning
Distributional reinforcement learning (DRL) extends the value-based appr...
research
∙
09/25/2019
PCMC-Net: Feature-based Pairwise Choice Markov Chains
Pairwise Choice Markov Chains (PCMC) have been recently introduced to ov...
research
∙
01/23/2019
kd-switch: A Universal Online Predictor with an application to Sequential Two-Sample Testing
We propose a novel online predictor for discrete labels conditioned on m...
research
∙
07/17/2018