On Ensuring that Intelligent Machines Are Well-Behaved

08/17/2017
by   Philip S. Thomas, et al.
0

Machine learning algorithms are everywhere, ranging from simple data analysis and pattern recognition tools used across the sciences to complex systems that achieve super-human performance on various tasks. Ensuring that they are well-behaved---that they do not, for example, cause harm to humans or act in a racist or sexist way---is therefore not a hypothetical problem to be dealt with in the future, but a pressing one that we address here. We propose a new framework for designing machine learning algorithms that simplifies the problem of specifying and regulating undesirable behaviors. To show the viability of this new framework, we use it to create new machine learning algorithms that preclude the sexist and harmful behaviors exhibited by standard machine learning algorithms in our experiments. Our framework for designing machine learning algorithms simplifies the safe and responsible application of machine learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2017

Machine Learning and Manycore Systems Design: A Serendipitous Symbiosis

Tight collaboration between experts of machine learning and manycore sys...
research
04/07/2018

Not quite unreasonable effectiveness of machine learning algorithms

State-of-the-art machine learning algorithms demonstrate close to absolu...
research
08/03/2023

Experimental Results regarding multiple Machine Learning via Quaternions

This paper presents an experimental study on the application of quaterni...
research
10/10/2019

Dialog on a canvas with a machine

We propose a new form of human-machine interaction. It is a pictorial ga...
research
10/26/2020

Balanced cooperative modeling

Machine learning techniques are often used for supporting a knowledge en...
research
08/31/2010

Pattern Recognition in Collective Cognitive Systems: Hybrid Human-Machine Learning (HHML) By Heterogeneous Ensembles

The ubiquitous role of the cyber-infrastructures, such as the WWW, provi...
research
02/26/2019

Unmasking Clever Hans Predictors and Assessing What Machines Really Learn

Current learning machines have successfully solved hard application prob...

Please sign up or login with your details

Forgot password? Click here to reset