LASER: LLM Agent with State-Space Exploration for Web Navigation

09/15/2023
by   Kaixin Ma, et al.
0

Large language models (LLMs) have been successfully adapted for interactive decision-making tasks like web navigation. While achieving decent performance, previous methods implicitly assume a forward-only execution mode for the model, where they only provide oracle trajectories as in-context examples to teach the model how to reason in the interactive environment. Consequently, the model could not handle more challenging scenarios not covered in the in-context examples, e.g., mistakes, leading to sub-optimal performance. To address this issue, we propose to model the interactive task as state space exploration, where the LLM agent transitions among a pre-defined set of states by performing actions to complete the task. This formulation enables flexible back-tracking, allowing the model to easily recover from errors. We evaluate our proposed LLM Agent with State-Space ExploRation (LASER) on the WebShop task. Experimental results show that our LASER agent significantly outperforms previous methods and closes the gap with human performance on the web navigation task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2020

Occupancy Anticipation for Efficient Exploration and Navigation

State-of-the-art navigation methods leverage a spatial memory to general...
research
07/15/2020

Active Visual Information Gathering for Vision-Language Navigation

Vision-language navigation (VLN) is the task of entailing an agent to ca...
research
07/12/2023

VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View

Incremental decision making in real-world environments is one of the mos...
research
03/04/2019

The StreetLearn Environment and Dataset

Navigation is a rich and well-grounded problem domain that drives progre...
research
07/11/2020

Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation

The ability to perform effective planning is crucial for building an ins...
research
06/20/2023

Cooperative Multi-Agent Learning for Navigation via Structured State Abstraction

Cooperative multi-agent reinforcement learning (MARL) for navigation ena...
research
01/30/2013

Flexible and Approximate Computation through State-Space Reduction

In the real world, insufficient information, limited computation resourc...

Please sign up or login with your details

Forgot password? Click here to reset