A compendium of data sources for data science, machine learning, and artificial intelligence

09/10/2023
by   Paul Bilokon, et al.
0

Recent advances in data science, machine learning, and artificial intelligence, such as the emergence of large language models, are leading to an increasing demand for data that can be processed by such models. While data sources are application-specific, and it is impossible to produce an exhaustive list of such data sources, it seems that a comprehensive, rather than complete, list would still benefit data scientists and machine learning experts of all levels of seniority. The goal of this publication is to provide just such an (inevitably incomplete) list – or compendium – of data sources across multiple areas of applications, including finance and economics, legal (laws and regulations), life sciences (medicine and drug discovery), news sentiment and social media, retail and ecommerce, satellite imagery, and shipping and logistics, and sports.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2023

Changing Data Sources in the Age of Machine Learning for Official Statistics

Data science has become increasingly essential for the production of off...
research
03/02/2023

A Vision for Semantically Enriched Data Science

The recent efforts in automation of machine learning or data science has...
research
02/11/2021

The Barrier of meaning in archaeological data science

Archaeologists, like other scientists, are experiencing a data-flood in ...
research
04/18/2023

METAM: Goal-Oriented Data Discovery

Data is a central component of machine learning and causal inference tas...
research
05/20/2018

Cost-Benefit Analysis of Data Intelligence – Its Broader Interpretations

The core of data science is our fundamental understanding about data int...
research
07/30/2021

Seeing poverty from space, how much can it be tuned?

Since the United Nations launched the Sustainable Development Goals (SDG...
research
01/30/2022

A Systematic Literature Review about Idea Mining: The Use of Machine-driven Analytics to Generate Ideas

Idea generation is the core activity of innovation. Digital data sources...

Please sign up or login with your details

Forgot password? Click here to reset