Minimalist Data Wrangling with Python

11/09/2022
by   Marek Gagolewski, et al.
0

Minimalist Data Wrangling with Python is envisaged as a student's first introduction to data science, providing a high-level overview as well as discussing key concepts in detail. We explore methods for cleaning data gathered from different sources, transforming, selecting, and extracting features, performing exploratory data analysis and dimensionality reduction, identifying naturally occurring data clusters, modelling patterns in data, comparing data between groups, and reporting the results. This textbook is a non-profit project. Its online and PDF versions are freely available at https://datawranglingpy.gagolewski.com/.

READ FULL TEXT
research
05/04/2022

DADApy: Distance-based Analysis of DAta-manifolds in Python

DADApy is a python software package for analysing and characterising hig...
research
04/02/2021

DataPrep.EDA: Task-Centric Exploratory Data Analysis for Statistical Modeling in Python

Exploratory Data Analysis (EDA) is a crucial step in any data science pr...
research
12/29/2022

Deep R Programming

Deep R Programming is a comprehensive course on one of the most popular ...
research
02/08/2021

PyAutoFit: A Classy Probabilistic Programming Language for Model Composition and Fitting

A major trend in academia and data science is the rapid adoption of Baye...
research
07/14/2016

8th European Conference on Python in Science (EuroSciPy 2015)

The 8th edition of the European Conference on Python in Science, EuroSci...
research
07/02/2016

Text comparison using word vector representations and dimensionality reduction

This paper describes a technique to compare large text sources using wor...
research
03/30/2022

Error Identification Strategies for Python Jupyter Notebooks

Computational notebooks – such as Jupyter or Colab – combine text and da...

Please sign up or login with your details

Forgot password? Click here to reset