Parallel computing with task scheduling
cuDF - GPU DataFrame Library
the portable Python dataframe library
N-D labeled arrays and datasets in Python
STUMPY is a powerful and scalable Python library for modern time series ...
Mars is a tensor-based unified framework for large-scale data computatio...
A package which efficiently applies any function to a pandas dataframe o...
A distributed task scheduler for Dask
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cu...
Eliot: the logging system that tells you *why* it happened
Python package for earth-observing satellite data processing
Scalable machine 🤖 learning for time series forecasting.
Fast data store for Pandas time-series data
Engine for ML/Data tracking, visualization, explainability, drift detect...
Pandas and Spark DataFrame comparison for humans and more!