PMLB v1.0: An open source dataset collection for benchmarking machine learning methods

11/30/2020
by   Joseph D. Romano, et al.
48

Motivation: Novel machine learning and statistical modeling studies rely on standardized comparisons to existing methods using well-studied benchmark datasets. Few tools exist that provide rapid access to many of these datasets through a standardized, user-friendly interface that integrates well with popular data science workflows. Results: This release of PMLB provides the largest collection of diverse, public benchmark datasets for evaluating new machine learning and data science methods aggregated in one location. v1.0 introduces a number of critical improvements developed following discussions with the open-source community. Availability: PMLB is available at https://github.com/EpistasisLab/pmlb. Python and R interfaces for PMLB can be installed through the Python Package Index and Comprehensive R Archive Network, respectively.

READ FULL TEXT
research
07/05/2023

RamanSPy: An open-source Python package for integrative Raman spectroscopy data analysis

Raman spectroscopy is a non-destructive and label-free chemical analysis...
research
08/17/2018

Benchmarking Automatic Machine Learning Frameworks

AutoML serves as the bridge between varying levels of expertise when des...
research
03/04/2021

GenoML: Automated Machine Learning for Genomics

GenoML is a Python package automating machine learning workflows for gen...
research
11/22/2022

OpenFE: Automated Feature Generation beyond Expert-level Performance

The goal of automated feature generation is to liberate machine learning...
research
03/01/2023

audb – Sharing and Versioning of Audio and Annotation Data in Python

Driven by the need for larger and more diverse datasets to pre-train and...
research
12/08/2022

Graph Learning Indexer: A Contributor-Friendly and Metadata-Rich Platform for Graph Learning Benchmarks

Establishing open and general benchmarks has been a critical driving for...
research
05/16/2018

MOABB: Trustworthy algorithm benchmarking for BCIs

BCI algorithm development has long been hampered by two major issues: sm...

Please sign up or login with your details

Forgot password? Click here to reset