A Survey on Semantics in Automated Data Science

05/16/2022
by   Udayan Khurana, et al.
0

Data Scientists leverage common sense reasoning and domain knowledge to understand and enrich data for building predictive models. In recent years, we have witnessed a surge in tools and techniques for automated machine learning. While data scientists can employ various such tools to help with model building, many other aspects such as feature engineering that require semantic understanding of concepts, remain manual to a large extent. In this paper we discuss important shortcomings of current automated data science solutions and machine learning. We discuss how leveraging basic semantic reasoning on data in combination with novel tools for data science automation can help with consistent and explainable data augmentation and transformation. Moreover, semantics can assist data scientists in a new manner by helping with challenges related to trust, bias, and explainability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/02/2023

A Vision for Semantically Enriched Data Science

The recent efforts in automation of machine learning or data science has...
research
03/08/2022

Model Positionality and Computational Reflexivity: Promoting Reflexivity in Data Science

Data science and machine learning provide indispensable techniques for u...
research
04/28/2019

Real numbers, data science and chaos: How to fit any dataset with a single parameter

We show how any dataset of any modality (time-series, images, sound...) ...
research
05/05/2023

GPT for Semi-Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering

As the field of automated machine learning (AutoML) advances, it becomes...
research
11/29/2018

Prediction Factory: automated development and collaborative evaluation of predictive models

In this paper, we present a data science automation system called Predic...
research
02/07/2023

Landscape of High-performance Python to Develop Data Science and Machine Learning Applications

Python has become the prime language for application development in the ...
research
04/26/2019

Survey on Automated Machine Learning

Machine learning has become a vital part in many aspects of our daily li...

Please sign up or login with your details

Forgot password? Click here to reset