A Rate-Distortion Framework for Explaining Black-box Model Decisions

10/12/2021
by   Stefan Kolek, et al.
0

We present the Rate-Distortion Explanation (RDE) framework, a mathematically well-founded method for explaining black-box model decisions. The framework is based on perturbations of the target input signal and applies to any differentiable pre-trained model such as neural networks. Our experiments demonstrate the framework's adaptability to diverse data modalities, particularly images, audio, and physical simulations of urban environments.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset