DOM-Q-NET: Grounded RL on Structured Language

02/19/2019
by   Sheng Jia, et al.
0

Building agents to interact with the web would allow for significant improvements in knowledge understanding and representation learning. However, web navigation tasks are difficult for current deep reinforcement learning (RL) models due to the large discrete action space and the varying number of actions between the states. In this work, we introduce DOM-Q-NET, a novel architecture for RL-based web navigation to address both of these problems. It parametrizes Q functions with separate networks for different action categories: clicking a DOM element and typing a string input. Our model utilizes a graph neural network to represent the tree-structured HTML of a standard web page. We demonstrate the capabilities of our model on the MiniWoB environment where we can match or outperform existing work without the use of expert demonstrations. Furthermore, we show 2x improvements in sample efficiency when training in the multi-task setting, allowing our model to transfer learned behaviours across tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/18/2018

Reinforced Imitation: Sample Efficient Deep Reinforcement Learning for Map-less Navigation by Leveraging Prior Demonstrations

This work presents a learning-based approach for target driven map-less ...
research
09/05/2019

Learning Action-Transferable Policy with Action Embedding

Despite achieving great success on performance in various sequential dec...
research
12/06/2019

VALAN: Vision and Language Agent Navigation

VALAN is a lightweight and scalable software framework for deep reinforc...
research
09/28/2018

Robot Representing and Reasoning with Knowledge from Reinforcement Learning

Reinforcement learning (RL) agents aim at learning by interacting with a...
research
02/24/2018

Reinforcement Learning on Web Interfaces Using Workflow-Guided Exploration

Reinforcement learning (RL) agents improve through trial-and-error, but ...
research
06/14/2023

Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning

Whereas machine learning models typically learn language by directly tra...
research
06/30/2021

Decomposing the Prediction Problem; Autonomous Navigation by neoRL Agents

Navigating the world is a fundamental ability for any living entity. Acc...

Please sign up or login with your details

Forgot password? Click here to reset