Statistically Enhanced Learning: a feature engineering framework to boost (any) learning algorithms

06/29/2023
by   Florian Felice, et al.
0

Feature engineering is of critical importance in the field of Data Science. While any data scientist knows the importance of rigorously preparing data to obtain good performing models, only scarce literature formalizes its benefits. In this work, we will present the method of Statistically Enhanced Learning (SEL), a formalization framework of existing feature engineering and extraction tasks in Machine Learning (ML). The difference compared to classical ML consists in the fact that certain predictors are not directly observed but obtained as statistical estimators. Our goal is to study SEL, aiming to establish a formalized framework and illustrate its improved performance by means of simulations as well as applications on real life use cases.

READ FULL TEXT
research
07/29/2019

sql4ml A declarative end-to-end workflow for machine learning

We present sql4ml, a system for expressing supervised machine learning (...
research
09/30/2022

Empowering the trustworthiness of ML-based critical systems through engineering activities

This paper reviews the entire engineering process of trustworthy Machine...
research
12/14/2020

Enabling collaborative data science development with the Ballet framework

While the open-source model for software development has led to successf...
research
03/26/2021

FeatureEnVi: Visual Analytics for Feature Engineering Using Stepwise Selection and Semi-Automatic Extraction Approaches

The machine learning (ML) life cycle involves a series of iterative step...
research
10/09/2019

Provenance Data in the Machine Learning Lifecycle in Computational Science and Engineering

Machine Learning (ML) has become essential in several industries. In Com...
research
12/22/2021

Machine Learning for Computational Science and Engineering – a brief introduction and some critical questions

Artificial Intelligence (AI) is now entering every sub-field of science,...
research
09/21/2018

Lexical Bias In Essay Level Prediction

Automatically predicting the level of non-native English speakers given ...

Please sign up or login with your details

Forgot password? Click here to reset