Large Language Model Is Semi-Parametric Reinforcement Learning Agent

06/09/2023
by   Danyang Zhang, et al.
0

Inspired by the insights in cognitive science with respect to human memory and reasoning mechanism, a novel evolvable LLM-based (Large Language Model) agent framework is proposed as REMEMBERER. By equipping the LLM with a long-term experience memory, REMEMBERER is capable of exploiting the experiences from the past episodes even for different task goals, which excels an LLM-based agent with fixed exemplars or equipped with a transient working memory. We further introduce Reinforcement Learning with Experience Memory (RLEM) to update the memory. Thus, the whole system can learn from the experiences of both success and failure, and evolve its capability without fine-tuning the parameters of the LLM. In this way, the proposed REMEMBERER constitutes a semi-parametric RL agent. Extensive experiments are conducted on two RL task sets to evaluate the proposed framework. The average results with different initialization and training sets exceed the prior SOTA by 4 for the success rate on two task sets and demonstrate the superiority and robustness of REMEMBERER.

READ FULL TEXT
research
12/21/2022

SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning

Pre-trained large language models can efficiently interpolate human-writ...
research
04/20/2023

Two-Memory Reinforcement Learning

While deep reinforcement learning has shown important empirical success,...
research
02/17/2022

Retrieval-Augmented Reinforcement Learning

Most deep reinforcement learning (RL) algorithms distill experience into...
research
07/15/2019

Federated Reinforcement Distillation with Proxy Experience Memory

In distributed reinforcement learning, it is common to exchange the expe...
research
09/28/2018

Robot Representing and Reasoning with Knowledge from Reinforcement Learning

Reinforcement learning (RL) agents aim at learning by interacting with a...
research
05/26/2023

Acting as Inverse Inverse Planning

Great storytellers know how to take us on a journey. They direct charact...
research
09/27/2021

The Tensor Brain: A Unified Theory of Perception, Memory and Semantic Decoding

We present a unified computational theory of perception and memory. In o...

Please sign up or login with your details

Forgot password? Click here to reset