TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents

08/07/2023
by   Jingqing Ruan, et al.
0

With recent advancements in natural language processing, Large Language Models (LLMs) have emerged as powerful tools for various real-world applications. Despite their prowess, the intrinsic generative abilities of LLMs may prove insufficient for handling complex tasks which necessitate a combination of task planning and the usage of external tools. In this paper, we first propose a structured framework tailored for LLM-based AI Agents and discuss the crucial capabilities necessary for tackling intricate problems. Within this framework, we design two distinct types of agents (i.e., one-step agent and sequential agent) to execute the inference process. Subsequently, we instantiate the framework using various LLMs and evaluate their Task Planning and Tool Usage (TPTU) abilities on typical tasks. By highlighting key findings and challenges, our goal is to provide a helpful resource for researchers and practitioners to leverage the power of LLMs in their AI applications. Our study emphasizes the substantial potential of these models, while also identifying areas that need more investigation and improvement.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/15/2022

Integrating AI Planning with Natural Language Processing: A Combination of Explicit and Tacit Knowledge

Automated planning focuses on strategies, building domain models and syn...
research
09/02/2023

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

Large language models (LLMs) have recently demonstrated remarkable capab...
research
08/20/2023

ChatEDA: A Large Language Model Powered Autonomous Agent for EDA

The integration of a complex set of Electronic Design Automation (EDA) t...
research
09/07/2023

Large Language Models as Optimizers

Optimization is ubiquitous. While derivative-based algorithms have been ...
research
06/08/2023

ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases

Enabling large language models to effectively utilize real-world tools i...
research
08/08/2023

AgentSims: An Open-Source Sandbox for Large Language Model Evaluation

With ChatGPT-like large language models (LLM) prevailing in the communit...
research
02/22/2021

Software Architecture for Next-Generation AI Planning Systems

Artificial Intelligence (AI) planning is a flourishing research and deve...

Please sign up or login with your details

Forgot password? Click here to reset