Learning Transferable Reward for Query Object Localization with Policy Adaptation

02/24/2022
by   Tingfeng Li, et al.
2

We propose a reinforcement learning based approach to query object localization, for which an agent is trained to localize objects of interest specified by a small exemplary set. We learn a transferable reward signal formulated using the exemplary set by ordinal metric learning. Our proposed method enables test-time policy adaptation to new environments where the reward signals are not readily available, and outperforms fine-tuning approaches that are limited to annotated images. In addition, the transferable reward allows repurposing the trained agent from one specific class to another class. Experiments on corrupted MNIST, CU-Birds, and COCO datasets demonstrate the effectiveness of our approach.

READ FULL TEXT

page 2

page 15

page 16

page 22

page 23

research
02/13/2018

Evolved Policy Gradients

We propose a meta-learning approach for learning gradient-based reinforc...
research
05/28/2018

Reward Constrained Policy Optimization

Teaching agents to perform tasks using Reinforcement Learning is no easy...
research
03/19/2018

Automated Curriculum Learning by Rewarding Temporally Rare Events

Reward shaping allows reinforcement learning (RL) agents to accelerate l...
research
08/17/2023

META-SELD: Meta-Learning for Fast Adaptation to the new environment in Sound Event Localization and Detection

For learning-based sound event localization and detection (SELD) methods...
research
05/17/2021

Learning to Win, Lose and Cooperate through Reward Signal Evolution

Solving a reinforcement learning problem typically involves correctly pr...
research
08/31/2018

Multi-Hop Knowledge Graph Reasoning with Reward Shaping

Multi-hop reasoning is an effective approach for query answering (QA) ov...

Please sign up or login with your details

Forgot password? Click here to reset