Sparks of Artificial General Intelligence: Early experiments with GPT-4

03/22/2023
by   Sébastien Bubeck, et al.
6

Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an early version of GPT-4, when it was still in active development by OpenAI. We contend that (this early version of) GPT-4 is part of a new cohort of LLMs (along with ChatGPT and Google's PaLM for example) that exhibit more general intelligence than previous AI models. We discuss the rising capabilities and implications of these models. We demonstrate that, beyond its mastery of language, GPT-4 can solve novel and difficult tasks that span mathematics, coding, vision, medicine, law, psychology and more, without needing any special prompting. Moreover, in all of these tasks, GPT-4's performance is strikingly close to human-level performance, and often vastly surpasses prior models such as ChatGPT. Given the breadth and depth of GPT-4's capabilities, we believe that it could reasonably be viewed as an early (yet still incomplete) version of an artificial general intelligence (AGI) system. In our exploration of GPT-4, we put special emphasis on discovering its limitations, and we discuss the challenges ahead for advancing towards deeper and more comprehensive versions of AGI, including the possible need for pursuing a new paradigm that moves beyond next-word prediction. We conclude with reflections on societal influences of the recent technological leap and future research directions.

READ FULL TEXT

page 12

page 13

page 18

page 19

page 23

page 24

page 33

page 41

research
06/02/2023

Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today

Recent investigations show that large language models (LLMs), specifical...
research
07/07/2023

Brain in a Vat: On Missing Pieces Towards Artificial General Intelligence in Large Language Models

In this perspective paper, we first comprehensively review existing eval...
research
05/10/2023

A Glimpse in ChatGPT Capabilities and its impact for AI research

Large language models (LLMs) have recently become a popular topic in the...
research
09/23/2021

Chess AI: Competing Paradigms for Machine Intelligence

Endgame studies have long served as a tool for testing human creativity ...
research
05/22/2023

Observations on LLMs for Telecom Domain: Capabilities and Limitations

The landscape for building conversational interfaces (chatbots) has witn...
research
08/21/2023

X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events

Intuitive physics is pivotal for human understanding of the physical wor...

Please sign up or login with your details

Forgot password? Click here to reset