Unsolved Problems in ML Safety

09/28/2021
by   Dan Hendrycks, et al.
0

Machine learning (ML) systems are rapidly increasing in size, are acquiring new capabilities, and are increasingly deployed in high-stakes settings. As with other powerful technologies, safety for ML should be a leading research priority. In response to emerging safety challenges in ML, such as those introduced by recent large-scale models, we provide a new roadmap for ML Safety and refine the technical problems that the field needs to address. We present four problems ready for research, namely withstanding hazards ("Robustness"), identifying hazards ("Monitoring"), steering ML systems ("Alignment"), and reducing risks to how ML systems are handled ("External Safety"). Throughout, we clarify each problem's motivation and provide concrete research directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2023

Concrete Safety for ML Problems: System Safety for ML Development and Assessment

Many stakeholders struggle to make reliances on ML-driven systems due to...
research
09/04/2023

MLGuard: Defend Your Machine Learning Model!

Machine Learning (ML) is used in critical highly regulated and high-stak...
research
01/14/2022

A causal model of safety assurance for machine learning

This paper proposes a framework based on a causal model of safety upon w...
research
07/20/2023

Deceptive Alignment Monitoring

As the capabilities of large machine learning models continue to grow, a...
research
08/30/2019

Cloudy with high chance of DBMS: A 10-year prediction for Enterprise-Grade ML

Machine learning (ML) has proven itself in high-value web applications s...
research
05/05/2023

All models are local: time to replace external validation with recurrent local validation

External validation is often recommended to ensure the generalizability ...
research
01/24/2020

When Wireless Security Meets Machine Learning: Motivation, Challenges, and Research Directions

Wireless systems are vulnerable to various attacks such as jamming and e...

Please sign up or login with your details

Forgot password? Click here to reset