Representational Multiplicity Should Be Exposed, Not Eliminated

06/17/2022
by   Ari Heljakka, et al.
0

It is prevalent and well-observed, but poorly understood, that two machine learning models with similar performance during training can have very different real-world performance characteristics. This implies elusive differences in the internals of the models, manifesting as representational multiplicity (RM). We introduce a conceptual and experimental setup for analyzing RM and show that certain training methods systematically result in greater RM than others, measured by activation similarity via singular vector canonical correlation analysis (SVCCA). We further correlate it with predictive multiplicity measured by the variance in i.i.d. and out-of-distribution test set predictions, in four common image data sets. We call for systematic measurement and maximal exposure, not elimination, of RM in models. Qualitative tools such as our confabulator analysis can facilitate understanding and communication of RM effects to stakeholders.

READ FULL TEXT
research
04/30/2013

Generalized Canonical Correlation Analysis for Classification

For multiple multivariate data sets, we derive conditions under which Ge...
research
06/28/2020

Modeling Generalization in Machine Learning: A Methodological and Computational Study

As machine learning becomes more and more available to the general publi...
research
09/03/2020

A general approach to bridge the reality-gap

Employing machine learning models in the real world requires collecting ...
research
12/05/2012

Making Early Predictions of the Accuracy of Machine Learning Applications

The accuracy of machine learning systems is a widely studied research to...
research
11/22/2019

Unsupervised Features Learning for Sampled Vector Fields

In this paper we introduce a new approach to computing hidden features o...
research
01/25/2019

Learning Models from Data with Measurement Error: Tackling Underreporting

Measurement error in observational datasets can lead to systematic bias ...
research
06/19/2017

SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability

We propose a new technique, Singular Vector Canonical Correlation Analys...

Please sign up or login with your details

Forgot password? Click here to reset