L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning

05/23/2023
by   Kibeom Kim, et al.
0

Tasks that involve interaction with various targets are called multi-target tasks. When applying general reinforcement learning approaches for such tasks, certain targets that are difficult to access or interact with may be neglected throughout the course of training - a predicament we call Under-explored Target Problem (UTP). To address this problem, we propose L-SA (Learning by adaptive Sampling and Active querying) framework that includes adaptive sampling and active querying. In the L-SA framework, adaptive sampling dynamically samples targets with the highest increase of success rates at a high proportion, resulting in curricular learning from easy to hard targets. Active querying prompts the agent to interact more frequently with under-explored targets that need more experience or exploration. Our experimental results on visual navigation tasks show that the L-SA framework improves sample efficiency as well as success rates on various multi-target tasks with UTP. Also, it is experimentally demonstrated that the cyclic relationship between adaptive sampling and active querying effectively improves the sample richness of under-explored targets and alleviates UTP.

READ FULL TEXT
research
10/25/2021

Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

Learning in a multi-target environment without prior knowledge about the...
research
11/11/2019

Context-aware Active Multi-Step Reinforcement Learning

Reinforcement learning has attracted great attention recently, especiall...
research
02/25/2019

Deep Bayesian Multi-Target Learning for Recommender Systems

With the increasing variety of services that e-commerce platforms provid...
research
10/25/2020

Learning Multi-Agent Coordination for Enhancing Target Coverage in Directional Sensor Networks

Maximum target coverage by adjusting the orientation of distributed sens...
research
05/07/2022

Multi-Target Active Object Tracking with Monte Carlo Tree Search and Target Motion Modeling

In this work, we are dedicated to multi-target active object tracking (A...
research
12/16/2022

Reinforcement Learning for Agile Active Target Sensing with a UAV

Active target sensing is the task of discovering and classifying an unkn...
research
07/31/2022

The Search and Rescue Game on a Cycle

We consider a search and rescue game introduced recently by the first au...

Please sign up or login with your details

Forgot password? Click here to reset