Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors

07/21/2023
by   Kolby Nottingham, et al.
0

Large language models (LLMs) are being applied as actors for sequential decision making tasks in domains such as robotics and games, utilizing their general world knowledge and planning abilities. However, previous work does little to explore what environment state information is provided to LLM actors via language. Exhaustively describing high-dimensional states can impair performance and raise inference costs for LLM actors. Previous LLM actors avoid the issue by relying on hand-engineered, task-specific protocols to determine which features to communicate about a state and which to leave out. In this work, we propose Brief Language INputs for DEcision-making Responses (BLINDER), a method for automatically selecting concise state descriptions by learning a value function for task-conditioned state descriptions. We evaluate BLINDER on the challenging video game NetHack and a robotic manipulation task. Our method improves task success rate, reduces input size and compute costs, and generalizes between LLM actors.

READ FULL TEXT

page 2

page 8

page 16

page 17

page 18

research
08/24/2023

Large Language Model as Autonomous Decision Maker

While large language models (LLMs) exhibit impressive language understan...
research
10/05/2020

Learning to Generalize for Sequential Decision Making

We consider problems of making sequences of decisions to accomplish task...
research
12/04/2022

Learning Automata-Based Task Knowledge Representation from Large-Scale Generative Language Models

Automata-based representations play an important role in control and pla...
research
10/22/2022

LMPriors: Pre-Trained Language Models as Task-Specific Priors

Particularly in low-data regimes, an outstanding challenge in machine le...
research
03/09/2020

Learning discrete state abstractions with deep variational inference

Abstraction is crucial for effective sequential decision making in domai...
research
12/04/2015

Reuse of Neural Modules for General Video Game Playing

A general approach to knowledge transfer is introduced in which an agent...
research
10/27/2022

LAD: Language Augmented Diffusion for Reinforcement Learning

Learning skills from language provides a powerful avenue for generalizat...

Please sign up or login with your details

Forgot password? Click here to reset