research
∙
08/29/2023
Pure Exploration under Mediators' Feedback
Stochastic multi-armed bandits are a sequential-decision-making framewor...
research
∙
05/07/2023
Truncating Trajectories in Monte Carlo Reinforcement Learning
In Reinforcement Learning (RL), an agent acts in an unknown environment ...
research
∙
07/25/2022
Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs
With the continuous growth of the global economy and markets, resource i...
research
∙
05/18/2021
Meta-Reinforcement Learning by Tracking Task Non-stationarity
Many real-world domains are subject to a structured non-stationarity whi...
research
∙
07/01/2020