Unmasking Clever Hans Predictors and Assessing What Machines Really Learn

02/26/2019
by   Sebastian Lapuschkin, et al.
0

Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly "intelligent" behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.

READ FULL TEXT
research
06/05/2019

A Survey of Behavior Learning Applications in Robotics -- State of the Art and Perspectives

Recent success of machine learning in many domains has been overwhelming...
research
08/17/2017

On Ensuring that Intelligent Machines Are Well-Behaved

Machine learning algorithms are everywhere, ranging from simple data ana...
research
05/28/2023

ConvGenVisMo: Evaluation of Conversational Generative Vision Models

Conversational generative vision models (CGVMs) like Visual ChatGPT (Wu ...
research
04/26/2017

The MacGyver Test - A Framework for Evaluating Machine Resourcefulness and Creative Problem Solving

Current measures of machine intelligence are either difficult to evaluat...
research
05/26/2022

YASMIN: Yet Another State MachINe library for ROS 2

State machines are a common mechanism for defining behaviors in robots, ...
research
07/02/2023

Neuro-Symbolic Sudoku Solver

Deep Neural Networks have achieved great success in some of the complex ...
research
03/28/2018

Who Let The Dogs Out? Modeling Dog Behavior From Visual Data

We introduce the task of directly modeling a visually intelligent agent....

Please sign up or login with your details

Forgot password? Click here to reset