Interpretable Model Summaries Using the Wasserstein Distance

12/18/2020
by   Eric Dunipace, et al.
0

In the current computing age, models can have hundreds or even thousands of parameters; however, such large models decrease the ability to interpret and communicate individual parameters. Reducing the dimensionality of the parameter space in the estimation phase is a commonly used technique, but less work has focused on selecting subsets of the parameters to focus on for interpretation–especially in settings such as Bayesian inference or bootstrapped frequentist inference that consider a distribution of estimates. Moreover, many models are not themselves easily interpretable and create another layer of obfuscation. To solve this gap, we introduce a new method that uses the Wasserstein distance to generate a low-dimensional interpretable model. After estimation of the main model, users can budget how many parameters they wish to interpret and our method will estimate an interpretable model of the desired dimension that minimizes the distance to the full model. We provide simulation results demonstrating the effectiveness of the proposed method and apply the method to cancer data.

READ FULL TEXT

page 16

page 27

page 38

page 39

research
09/16/2019

Estimation of Wasserstein distances in the Spiked Transport Model

We propose a new statistical model, the spiked transport model, which fo...
research
05/29/2018

Wasserstein Variational Inference

This paper introduces Wasserstein variational inference, a new form of a...
research
01/18/2022

Bayesian calibration of Arterial Windkessel Model

This work is motivated by personalized digital twins based on observatio...
research
11/16/2021

Ocean Mover's Distance: Using Optimal Transport for Analyzing Oceanographic Data

Modern ocean datasets are large, multi-dimensional, and inherently spati...
research
05/16/2022

binspp: An R Package for Bayesian Inference for Neyman-Scott Point Processes with Complex Inhomogeneity Structure

The Neyman-Scott point process is a widely used point process model whic...
research
07/04/2018

Accelerated First-order Methods on the Wasserstein Space for Bayesian Inference

We consider doing Bayesian inference by minimizing the KL divergence on ...
research
05/09/2019

Approximate Bayesian computation with the Wasserstein distance

A growing number of generative statistical models do not permit the nume...

Please sign up or login with your details

Forgot password? Click here to reset