Defining data science: a new field of inquiry

06/28/2023
by   Michael L Brodie, et al.
0

Data science is not a science. It is a research paradigm. Its power, scope, and scale will surpass science, our most powerful research paradigm, to enable knowledge discovery and change our world. We have yet to understand and define it, vital to realizing its potential and managing its risks. Modern data science is in its infancy. Emerging slowly since 1962 and rapidly since 2000, it is a fundamentally new field of inquiry, one of the most active, powerful, and rapidly evolving 21st century innovations. Due to its value, power, and applicability, it is emerging in over 40 disciplines, hundreds of research areas, and thousands of applications. Millions of data science publications contain myriad definitions of data science and data science problem solving. Due to its infancy, many definitions are independent, application specific, mutually incomplete, redundant, or inconsistent, hence so is data science. This research addresses this data science multiple definitions challenge by proposing the development of coherent, unified definition based on a data science reference framework using a data science journal for the data science community to achieve such a definition. This paper provides candidate definitions for essential data science artifacts that are required to discuss such a definition. They are based on the classical research paradigm concept consisting of a philosophy of data science, the data science problem solving paradigm, and the six component data science reference framework (axiology, ontology, epistemology, methodology, methods, technology) that is a frequently called for unifying framework with which to define, unify, and evolve data science. It presents challenges for defining data science, solution approaches, i.e., means for defining data science, and their requirements and benefits as the basis of a comprehensive solution.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/19/2023

A data science axiology: the nature, value, and risks of data science

Data science is not a science. It is a research paradigm with an unfatho...
research
12/27/2016

Theory-guided Data Science: A New Paradigm for Scientific Discovery from Data

Data science models, although successful in a number of commercial domai...
research
10/15/2019

Spatial Data Science: Closing the human-spatial computing-environment loop

Over the last decade, the term spatial computing has grown to have two d...
research
02/20/2020

Toward An Interdisciplinary Methodology to Solve New (Old) Transportation Problems

The rising availability of digital traces provides a fertile ground for ...
research
05/16/2022

A BenchCouncil View on Benchmarking Emerging and Future Computing

The measurable properties of the artifacts or objects in the computer, m...
research
06/08/2021

Defining definition: a Text mining Approach to Define Innovative Technological Fields

One of the first task of an innovative project is delineating the scope ...
research
11/25/2021

Federated Data Science to Break Down Silos [Vision]

Similar to Open Data initiatives, data science as a community has launch...

Please sign up or login with your details

Forgot password? Click here to reset