Landscape of High-performance Python to Develop Data Science and Machine Learning Applications

02/07/2023
by   Oscar Castro, et al.
0

Python has become the prime language for application development in the Data Science and Machine Learning domains. However, data scientists are not necessarily experienced programmers. While Python lets them quickly implement their algorithms, when moving at scale, computation efficiency becomes inevitable. Thus, harnessing high-performance devices such as multicore processors and Graphical Processing Units (GPUs) to their potential is generally not trivial. The present narrative survey was thought as a reference document for such practitioners to help them make their way in the wealth of tools and techniques available for the Python language. Our document revolves around user scenarios, which are meant to cover most situations they may face. We believe that this document may also be of practical use to tool developers, who may use our work to identify potential lacks in existing tools and help them motivate their contributions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2016

Want Drugs? Use Python

We describe how Python can be leveraged to streamline the curation, mode...
research
02/12/2020

Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence

Smarter applications are making better use of the insights gleaned from ...
research
03/18/2021

KGTorrent: A Dataset of Python Jupyter Notebooks from Kaggle

Computational notebooks have become the tool of choice for many data sci...
research
05/16/2022

A Survey on Semantics in Automated Data Science

Data Scientists leverage common sense reasoning and domain knowledge to ...
research
03/15/2023

PyExaFMM: an exercise in designing high-performance software with Python and Numba

Numba is a game-changing compiler for high-performance computing with Py...
research
01/07/2020

Vamsa: Tracking Provenance in Data Science Scripts

Machine learning (ML) which was initially adopted for search ranking and...
research
05/10/2021

A practical, effective calculation of gamma difference distributions with open data science tools

At present, there is still no officially accepted and extensively verifi...

Please sign up or login with your details

Forgot password? Click here to reset