Language Models can Solve Computer Tasks

03/30/2023
by   Geunwoo Kim, et al.
2

Agents capable of carrying out general tasks on a computer can improve efficiency and productivity by automating repetitive tasks and assisting in complex problem-solving. Ideally, such agents should be able to solve new computer tasks presented to them through natural language commands. However, previous approaches to this problem require large amounts of expert demonstrations and task-specific reward functions, both of which are impractical for new tasks. In this work, we show that a pre-trained large language model (LLM) agent can execute computer tasks guided by natural language using a simple prompting scheme where the agent recursively criticizes and improves its output (RCI). The RCI approach significantly outperforms existing LLM methods for automating computer tasks and surpasses supervised learning (SL) and reinforcement learning (RL) approaches on the MiniWoB++ benchmark. RCI is competitive with the state-of-the-art SL+RL method, using only a handful of demonstrations per task rather than tens of thousands, and without a task-specific reward function. Furthermore, we demonstrate RCI prompting's effectiveness in enhancing LLMs' reasoning abilities on a suite of natural language reasoning tasks, outperforming chain of thought (CoT) prompting. We find that RCI combined with CoT performs better than either separately.

READ FULL TEXT

page 2

page 4

page 5

page 11

research
02/27/2023

Reward Design with Language Models

Reward design in reinforcement learning (RL) is challenging since specif...
research
02/01/2023

Collaborating with language models for embodied reasoning

Reasoning in a complex and ambiguous environment is a key goal for Reinf...
research
06/13/2023

Synapse: Leveraging Few-Shot Exemplars for Human-Level Computer Control

This paper investigates the design of few-shot exemplars for computer au...
research
02/16/2022

A data-driven approach for learning to control computers

It would be useful for machines to use computers as humans do so that th...
research
09/20/2021

Learning Natural Language Generation from Scratch

This paper introduces TRUncated ReinForcement Learning for Language (Tru...
research
04/26/2022

SkillNet-NLG: General-Purpose Natural Language Generation with a Sparsely Activated Approach

We present SkillNet-NLG, a sparsely activated approach that handles many...
research
03/19/2023

CLIP4MC: An RL-Friendly Vision-Language Model for Minecraft

One of the essential missions in the AI research community is to build a...

Please sign up or login with your details

Forgot password? Click here to reset