Galactica: A Large Language Model for Science

11/16/2022
by   Ross Taylor, et al.
13

Information overload is a major obstacle to scientific progress. The explosive growth in scientific literature and data has made it ever harder to discover useful insights in a large mass of information. Today scientific knowledge is accessed through search engines, but they are unable to organize scientific knowledge alone. In this paper we introduce Galactica: a large language model that can store, combine and reason about scientific knowledge. We train on a large scientific corpus of papers, reference material, knowledge bases and many other sources. We outperform existing models on a range of scientific tasks. On technical knowledge probes such as LaTeX equations, Galactica outperforms the latest GPT-3 by 68.2 performs well on reasoning, outperforming Chinchilla on mathematical MMLU by 41.3 also sets a new state-of-the-art on downstream tasks such as PubMedQA and MedMCQA dev of 77.6 corpus, Galactica outperforms BLOOM and OPT-175B on BIG-bench. We believe these results demonstrate the potential for language models as a new interface for science. We open source the model for the benefit of the scientific community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/03/2020

Elsevier OA CC-By Corpus

We introduce the Elsevier OA CC-BY corpus. This is the first open corpus...
research
09/30/2021

MatSciBERT: A Materials Domain Language Model for Text Mining and Information Extraction

An overwhelmingly large amount of knowledge in the materials domain is g...
research
11/29/2022

Improving astroBERT using Semantic Textual Similarity

The NASA Astrophysics Data System (ADS) is an essential tool for researc...
research
08/25/2023

DARWIN Series: Domain Specific Large Language Models for Natural Science

Emerging tools bring forth fresh approaches to work, and the field of na...
research
11/27/2021

Common Sense Knowledge Learning for Open Vocabulary Neural Reasoning: A First View into Chronic Disease Literature

In this paper, we address reasoning tasks from open vocabulary Knowledge...
research
08/02/2023

LLMs Understand Glass-Box Models, Discover Surprises, and Suggest Repairs

We show that large language models (LLMs) are remarkably good at working...
research
05/24/2023

The ACL OCL Corpus: advancing Open science in Computational Linguistics

We present a scholarly corpus from the ACL Anthology to assist Open scie...

Please sign up or login with your details

Forgot password? Click here to reset