Delphi: Towards Machine Ethics and Norms

10/14/2021
by   Liwei Jiang, et al.
47

What would it take to teach a machine to behave ethically? While broad ethical rules may seem straightforward to state ("thou shalt not kill"), applying such rules to real-world situations is far more complex. For example, while "helping a friend" is generally a good thing to do, "helping a friend spread fake news" is not. We identify four underlying challenges towards machine ethics and norms: (1) an understanding of moral precepts and social norms; (2) the ability to perceive real-world situations visually or by reading natural language descriptions; (3) commonsense reasoning to anticipate the outcome of alternative actions in different contexts; (4) most importantly, the ability to make ethical judgments given the interplay between competing values and their grounding in different contexts (e.g., the right to freedom of expression vs. preventing the spread of fake news). Our paper begins to address these questions within the deep learning paradigm. Our prototype model, Delphi, demonstrates strong promise of language-based commonsense moral reasoning, with up to 92.1 humans. This is in stark contrast to the zero-shot performance of GPT-3 of 52.3 neural language models with human values. Thus, we present Commonsense Norm Bank, a moral textbook customized for machines, which compiles 1.7M examples of people's ethical judgments on a broad spectrum of everyday situations. In addition to the new resources and baseline performances for future research, our study provides new insights that lead to several important open research questions: differentiating between universal human values and personal values, modeling different moral frameworks, and explainable, consistent approaches to machine ethics.

READ FULL TEXT

page 20

page 32

page 38

page 39

page 40

page 41

research
11/01/2020

Social Chemistry 101: Learning to Reason about Social and Moral Norms

Social norms—the unspoken commonsense rules about acceptable social beha...
research
12/11/2019

BERT has a Moral Compass: Improvements of ethical and moral values of machines

Allowing machines to choose whether to kill humans would be devastating ...
research
08/15/2019

Abductive Commonsense Reasoning

Abductive reasoning is inference to the most plausible explanation. For ...
research
12/05/2022

Fake News and Hate Speech: Language in Common

In this paper we raise the research question of whether fake news and ha...
research
03/01/2023

That's All Folks: a KG of Values as Commonsense Social Norms and Behaviors

Values, as intended in ethics, determine the shape and validity of moral...
research
09/17/2021

Repurposing of Resources: from Everyday Problem Solving through to Crisis Management

The human ability to repurpose objects and processes is universal, but i...
research
08/20/2020

Scruples: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes

As AI systems become an increasing part of people's everyday lives, it b...

Please sign up or login with your details

Forgot password? Click here to reset