Breiman's two cultures: You don't have to choose sides

04/25/2021
by   Andrew C. Miller, et al.
0

Breiman's classic paper casts data analysis as a choice between two cultures: data modelers and algorithmic modelers. Stated broadly, data modelers use simple, interpretable models with well-understood theoretical properties to analyze data. Algorithmic modelers prioritize predictive accuracy and use more flexible function approximations to analyze data. This dichotomy overlooks a third set of models - mechanistic models derived from scientific theories (e.g., ODE/SDE simulators). Mechanistic models encode application-specific scientific knowledge about the data. And while these categories represent extreme points in model space, modern computational and algorithmic tools enable us to interpolate between these points, producing flexible, interpretable, and scientifically-informed hybrids that can enjoy accurate and robust predictions, and resolve issues with data analysis that Breiman describes, such as the Rashomon effect and Occam's dilemma. Challenges still remain in finding an appropriate point in model space, with many choices on how to compose model components and the degree to which each component informs inferences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2015

Neuro-Fuzzy Algorithmic (NFA) Models and Tools for Estimation

Accurate estimation such as cost estimation, quality estimation and risk...
research
01/02/2023

Science Platforms for Heliophysics Data Analysis

We recommend that NASA maintain and fund science platforms that enable i...
research
08/27/2019

Ordered Sets for Data Analysis

This book dwells on mathematical and algorithmic issues of data analysis...
research
01/20/2020

An interpretable neural network model through piecewise linear approximation

Most existing interpretable methods explain a black-box model in a post-...
research
02/15/2023

On the Hyperparameters influencing a PINN's generalization beyond the training domain

Physics-Informed Neural Networks (PINNs) are Neural Network architecture...
research
07/17/2020

Workflows in AiiDA: Engineering a high-throughput, event-based engine for robust and modular computational workflows

Over the last two decades, the field of computational science has seen a...
research
10/14/2019

code::proof: Prepare for most weather conditions

Computational tools for data analysis are being released daily on reposi...

Please sign up or login with your details

Forgot password? Click here to reset