
Deep reinforcement learning for largescale epidemic control
Epidemics of infectious diseases are an important threat to public healt...
read it

An interpretable semisupervised classifier using two different strategies for amended selflabeling
In the context of some machine learning applications, obtaining data ins...
read it

A utilitybased analysis of equilibria in multiobjective normal form games
In multiobjective multiagent systems (MOMAS), agents explicitly consid...
read it

Modelbased MultiAgent Reinforcement Learning with Cooperative Prioritized Sweeping
We present a new modelbased reinforcement learning algorithm, Cooperati...
read it

Fleet Control using Coregionalized Gaussian Process Policy Iteration
In many settings, as for example wind farms, multiple machines are insta...
read it

MultiAgent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures
Multiagent coordination is prevalent in many realworld applications. H...
read it

Thompson Sampling for Factored MultiAgent Bandits
Multiagent coordination is prevalent in many realworld applications. H...
read it

IPCNet: 3D pointcloud segmentation using deep interpoint convolutional layers
Over the last decade, the demand for better segmentation and classificat...
read it

MultiObjective MultiAgent Decision Making: A Utilitybased Analysis and Survey
The majority of multiagent system (MAS) implementations aim to optimise...
read it

Transfer Learning Across Simulated Robots With Different Sensors
For a robot to learn a good policy, it often requires expensive equipmen...
read it

SampleEfficient ModelFree Reinforcement Learning with OffPolicy Critics
Valuebased reinforcementlearning algorithms are currently stateofthe...
read it

The ActorAdvisor: Policy Gradient With OffPolicy Advice
Actorcritic algorithms learn an explicit policy (actor), and an accompa...
read it

Dynamic Weights in MultiObjective Deep Reinforcement Learning
Many realworld decision problems are characterized by multiple objectiv...
read it

Directed Policy Gradient for Safe Reinforcement Learning with Human Advice
Many currently deployed Reinforcement Learning agents work in an environ...
read it

Ordered Preference Elicitation Strategies for Supporting MultiObjective Decision Making
In multiobjective decision planning and learning, much attention is pai...
read it

Bayesian BestArm Identification for Selecting Influenza Mitigation Strategies
Pandemic influenza has the epidemic potential to kill millions of people...
read it

Learning with Options that Terminate OffPolicy
A temporally abstract action, or an option, is specified by a policy and...
read it

Reinforcement Learning in POMDPs with Memoryless Options and OptionObservation Initiation Sets
Many realworld reinforcement learning problems have a hierarchical natu...
read it

Analysing Congestion Problems in Multiagent Reinforcement Learning
Congestion problems are omnipresent in today's complex networks and repr...
read it

Solving stable matching problems using answer set programming
Since the introduction of the stable marriage problem (SMP) by Gale and ...
read it

OffPolicy Reward Shaping with Ensembles
Potentialbased reward shaping (PBRS) is an effective and popular techni...
read it

OffPolicy Shaping Ensembles in Reinforcement Learning
Recent advances of gradient temporaldifference methods allow to learn o...
read it

Modeling Stable Matching Problems with Answer Set Programming
The Stable Marriage Problem (SMP) is a wellknown matching problem first...
read it
Ann Nowé
is this you? claim profile