chemmodlab: A Cheminformatics Modeling Laboratory for Fitting and Assessing Machine Learning Models

The goal of chemmodlab is to streamline the fitting and assessment pipeline for many machine learning models in R, making it easy for researchers to compare the utility of new models. While focused on implementing methods for model fitting and assessment that have been accepted by experts in the cheminformatics field, all of the methods in chemmodlab have broad utility for the machine learning community. chemmodlab contains several assessment utilities including a plotting function that constructs accumulation curves and a function that computes many performance measures. The most novel feature of chemmodlab is the ease with which statistically significant performance differences for many machine learning models is presented by means of the multiple comparisons similarity plot. Differences are assessed using repeated k-fold cross validation where blocking increases precision and multiplicity adjustments are applied.

READ FULL TEXT

page 2

page 11

page 13

page 14

page 15

page 16

page 17

page 18

research
01/23/2020

Improving generalisation of AutoML systems with dynamic fitness evaluations

A common problem machine learning developers are faced with is overfitti...
research
02/12/2020

A Hierarchy of Limitations in Machine Learning

"All models are wrong, but some are useful", wrote George E. P. Box (197...
research
05/17/2023

Nine tips for ecologists using machine learning

Due to their high predictive performance and flexibility, machine learni...
research
01/11/2021

Distributed Double Machine Learning with a Serverless Architecture

This paper explores serverless cloud computing for double machine learni...
research
01/30/2023

MOSAIC, acomparison framework for machine learning models

We introduce MOSAIC, a Python program for machine learning models. Our f...
research
03/13/2023

Assessing the performance of spatial cross-validation approaches for models of spatially structured data

Evaluating models fit to data with internal spatial structure requires s...
research
06/02/2021

Undecidability of Learnability

Machine learning researchers and practitioners steadily enlarge the mult...

Please sign up or login with your details

Forgot password? Click here to reset