Top-down Transformation Choice

06/26/2017
by   Torsten Hothorn, et al.
0

Simple models are preferred over complex models, but over-simplistic models could lead to erroneous interpretations. The classical approach is to start with a simple model, whose shortcomings are assessed in residual-based model diagnostics. Eventually, one increases the complexity of this initial overly simple model and obtains a better-fitting model. I illustrate how transformation analysis can be used as an alternative approach to model choice. Instead of adding complexity to simple models, step-wise complexity reduction is used to help identify simpler and better-interpretable models. As an example, body mass index distributions in Switzerland are modelled by means of transformation models to understand the impact of sex, age, smoking and other lifestyle factors on a person's body mass index. In this process, I searched for a compromise between model fit and model interpretability. Special emphasis is given to the understanding of the connections between transformation models of increasing complexity. The models used in this analysis ranged from evergreens, such as the normal linear regression model with constant variance, to novel models with extremely flexible conditional distribution functions, such as transformation trees and transformation forests.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2017

Transformation Forests

Regression models for supervised learning problems with a continuous tar...
research
11/03/2021

Evaluation of Tree Based Regression over Multiple Linear Regression for Non-normally Distributed Data in Battery Performance

Battery performance datasets are typically non-normal and multicollinear...
research
10/28/2021

Exoplanet atmosphere evolution: emulation with random forests

Atmospheric mass-loss is known to play a leading role in sculpting the d...
research
10/04/2022

Flexible Instrumental Variable Models With Bayesian Additive Regression Trees

Methods utilizing instrumental variables have been a fundamental statist...
research
08/17/2018

The Function Transformation Omics - Funomics

There are no two identical leaves in the world, so how to find effective...
research
05/10/2019

Statistical inference with anchored Bayesian mixture of regressions models: A case study analysis of allometric data

We present a case study in which we use a mixture of regressions model t...
research
06/14/2022

The Kidneys Are Not All Normal: Investigating the Speckle Distributions of Transplanted Kidneys

Modelling ultrasound speckle has generated considerable interest for its...

Please sign up or login with your details

Forgot password? Click here to reset