Experimental Design for Bathymetry Editing

07/15/2020
by   Julaiti Alafate, et al.
11

We describe an application of machine learning to a real-world computer assisted labeling task. Our experimental results expose significant deviations from the IID assumption commonly used in machine learning. These results suggest that the common random split of all data into training and testing can often lead to poor performance.

READ FULL TEXT
research
02/07/2022

Optimal Ratio for Data Splitting

It is common to split a dataset into training and testing sets before fi...
research
12/17/2021

Quality of Data in Machine Learning

A common assumption exists according to which machine learning models im...
research
03/19/2020

Homeostasis phenomenon in predictive inference when using a wrong learning model: a tale of random split of data into training and test sets

This note uses a conformal prediction procedure to provide further suppo...
research
06/14/2023

Distribution Shift Inversion for Out-of-Distribution Prediction

Machine learning society has witnessed the emergence of a myriad of Out-...
research
05/21/2021

Nori: Concealing the Concealed Identifier in 5G

IMSI catchers have been a long standing and serious privacy problem in p...
research
06/08/2021

A critical look at the current train/test split in machine learning

The randomized or cross-validated split of training and testing sets has...
research
07/20/2018

TESSERACT: Eliminating Experimental Bias in Malware Classification across Space and Time

Academic research on machine learning-based malware classification appea...

Please sign up or login with your details

Forgot password? Click here to reset