Can large language models democratize access to dual-use biotechnology?

06/06/2023
by   Emily H. Soice, et al.
0

Large language models (LLMs) such as those embedded in 'chatbots' are accelerating and democratizing research by providing comprehensible information and expertise from many different fields. However, these models may also confer easy access to dual-use technologies capable of inflicting great harm. To evaluate this risk, the 'Safeguarding the Future' course at MIT tasked non-scientist students with investigating whether LLM chatbots could be prompted to assist non-experts in causing a pandemic. In one hour, the chatbots suggested four potential pandemic pathogens, explained how they can be generated from synthetic DNA using reverse genetics, supplied the names of DNA synthesis companies unlikely to screen orders, identified detailed protocols and how to troubleshoot them, and recommended that anyone lacking the skills to perform reverse genetics engage a core facility or contract research organization. Collectively, these results suggest that LLMs will make pandemic-class agents widely accessible as soon as they are credibly identified, even to people with little or no laboratory training. Promising nonproliferation measures include pre-release evaluations of LLMs by third parties, curating training datasets to remove harmful concepts, and verifiably screening all DNA generated by synthesis providers or used by contract research organizations and robotic cloud laboratories to engineer organisms or viruses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2023

Artificial intelligence and biological misuse: Differentiating risks of language models and biological design tools

As advancements in artificial intelligence propel progress in the life s...
research
11/30/2020

Batch Optimization for DNA Synthesis

Large pools of synthetic DNA molecules have been recently used to reliab...
research
10/18/2021

DNA Codes over the Ring ℤ_4 + wℤ_4

In this present work, we generalize the study of construction of DNA cod...
research
08/14/2023

CodeHelp: Using Large Language Models with Guardrails for Scalable Support in Programming Classes

Computing educators face significant challenges in providing timely supp...
research
05/07/2020

Coding for Optimized Writing Rate in DNA Storage

A method for encoding information in DNA sequences is described. The met...
research
11/28/2020

Cyberbiosecurity: DNA Injection Attack in Synthetic Biology

Today arbitrary synthetic DNA can be ordered online and delivered within...
research
10/19/2022

NDN-TR70 – Utilizing NDN-DPDK for Kubernetes Genomics Data Lake

As the growth of genomics samples rapidly expands due to increased acces...

Please sign up or login with your details

Forgot password? Click here to reset