Towards a Benchmark for Scientific Understanding in Humans and Machines

04/20/2023
by   Kristian Gonzalez Barman, et al.
0

Scientific understanding is a fundamental goal of science, allowing us to explain the world. There is currently no good way to measure the scientific understanding of agents, whether these be humans or Artificial Intelligence systems. Without a clear benchmark, it is challenging to evaluate and compare different levels of and approaches to scientific understanding. In this Roadmap, we propose a framework to create a benchmark for scientific understanding, utilizing tools from philosophy of science. We adopt a behavioral notion according to which genuine understanding should be recognized as an ability to perform certain tasks. We extend this notion by considering a set of questions that can gauge different levels of scientific understanding, covering information retrieval, the capability to arrange information to produce an explanation, and the ability to infer how things would be different under different circumstances. The Scientific Understanding Benchmark (SUB), which is formed by a set of these tests, allows for the evaluation and comparison of different approaches. Benchmarking plays a crucial role in establishing trust, ensuring quality control, and providing a basis for performance evaluation. By aligning machine and human scientific understanding we can improve their utility, ultimately advancing scientific understanding and helping to discover new insights within machines.

READ FULL TEXT

page 6

page 8

research
04/04/2022

On scientific understanding with artificial intelligence

Imagine an oracle that correctly predicts the outcome of every particle ...
research
10/26/2020

Understanding understanding: a renormalization group inspired model of (artificial) intelligence

This paper is about the meaning of understanding in scientific and in ar...
research
01/18/2021

Dissonance Between Human and Machine Understanding

Complex machine learning models are deployed in several critical domains...
research
05/22/2023

Beneath Surface Similarity: Large Language Models Make Reasonable Scientific Analogies after Structure Abduction

Analogical reasoning is essential for human cognition, allowing us to co...
research
12/04/2015

What Makes it Difficult to Understand a Scientific Literature?

In the artificial intelligence area, one of the ultimate goals is to mak...
research
11/06/2016

Learning to Perform Physics Experiments via Deep Reinforcement Learning

When encountering novel objects, humans are able to infer a wide range o...
research
01/08/2021

SDRBench: Scientific Data Reduction Benchmark for Lossy Compressors

Efficient error-controlled lossy compressors are becoming critical to th...

Please sign up or login with your details

Forgot password? Click here to reset