Parallel computing with task scheduling
cuDF - GPU DataFrame Library
Koalas: pandas API on Apache Spark
STUMPY is a powerful and scalable Python library for modern time series ...
Extract data from a wide range of Internet sources into a pandas DataFrame.
A distributed task scheduler for Dask
Clean APIs for data cleaning. Python implementation of R package Janitor
PyData, The Complete Works of
RFC document, tooling and other content related to the array API standard
A consistent table management library in python
Resources for Advancing into Analytics: From Excel to R and Python by Ge...
Notebooks for the Seattle PyData 2017 talk on Scattertext
Social network analysis code examples for PyCon 2019 talk
Machine learning with scikit-learn tutorial at PyData Chicago 2016
Introduction to Machine Learning with Time Series at PyData Festival Ams...