Fixes That Fail: Self-Defeating Improvements in Machine-Learning Systems

03/22/2021
by   Ruihan Wu, et al.
0

Machine-learning systems such as self-driving cars or virtual assistants are composed of a large number of machine-learning models that recognize image content, transcribe speech, analyze natural language, infer preferences, rank options, etc. These systems can be represented as directed acyclic graphs in which each vertex is a model, and models feed each other information over the edges. Oftentimes, the models are developed and trained independently, which raises an obvious concern: Can improving a machine-learning model make the overall system worse? We answer this question affirmatively by showing that improving a model can deteriorate the performance of downstream models, even after those downstream models are retrained. Such self-defeating improvements are the result of entanglement between the models. We identify different types of entanglement and demonstrate via simple experiments how they can produce self-defeating improvements. We also show that self-defeating improvements emerge in a realistic stereo-based object detection system.

READ FULL TEXT

page 4

page 7

page 12

research
02/26/2019

Grammar Based Directed Testing of Machine Learning Systems

The massive progress of machine learning has seen its application over a...
research
03/24/2018

Learning architectures based on quantum entanglement: a simple matrix product state algorithm for image recognition

It is a fundamental, but still elusive question whether methods based on...
research
06/20/2018

Interpretable to Whom? A Role-based Model for Analyzing Interpretable Machine Learning Systems

Several researchers have argued that a machine learning system's interpr...
research
04/21/2023

Tokenization Tractability for Human and Machine Learning Model: An Annotation Study

Is tractable tokenization for humans also tractable for machine learning...
research
06/19/2023

AVOIDDS: Aircraft Vision-based Intruder Detection Dataset and Simulator

Designing robust machine learning systems remains an open problem, and t...
research
10/27/2017

Tensor network language model

We propose a new statistical model suitable for machine learning of syst...
research
07/09/2022

Towards Highly Expressive Machine Learning Models of Non-Melanoma Skin Cancer

Pathologists have a rich vocabulary with which they can describe all the...

Please sign up or login with your details

Forgot password? Click here to reset