An Evaluation of GPT-4 on the ETHICS Dataset

09/19/2023
by   Sergey Rodionov, et al.
0

This report summarizes a short study of the performance of GPT-4 on the ETHICS dataset. The ETHICS dataset consists of five sub-datasets covering different fields of ethics: Justice, Deontology, Virtue Ethics, Utilitarianism, and Commonsense Ethics. The moral judgments were curated so as to have a high degree of agreement with the aim of representing shared human values rather than moral dilemmas. GPT-4's performance is much better than that of previous models and suggests that learning to work with common human values is not the hard problem for AI ethics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2020

Deriving Commonsense Inference Tasks from Interactive Fictions

Commonsense reasoning simulates the human ability to make presumptions a...
research
08/05/2020

Aligning AI With Shared Human Values

We show how to assess a language model's knowledge of basic concepts of ...
research
01/31/2023

The Touché23-ValueEval Dataset for Identifying Human Values behind Arguments

We present the Touché23-ValueEval Dataset for Identifying Human Values b...
research
04/08/2019

CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense

Commonsense reasoning is a critical AI capability, but it is difficult t...
research
04/08/2019

AQuA: An Adversarially Authored Question-Answer Dataset for Common Sense

Commonsense reasoning is a critical AI capability, but it is difficult t...
research
12/23/2021

Toward a New Science of Common Sense

Common sense has always been of interest in AI, but has rarely taken cen...
research
05/22/2022

Commonsense Knowledge Salience Evaluation with a Benchmark Dataset in E-commerce

In e-commerce, the salience of commonsense knowledge (CSK) is beneficial...

Please sign up or login with your details

Forgot password? Click here to reset