A curated list of awesome big data frameworks, ressources and other awes...
Miller is like awk, sed, cut, join, and sort for name-indexed data such ...
Fancy stream processing made operationally mundane
The data warehouse for operational workloads.
🌊 Online machine learning in Python
Readyset is a MySQL and Postgres wire-compatible caching layer that sits...
Utils for streaming large files (S3, HDFS, gzip, bz2...)
Open-source graph database, built for real-time streaming data, compatib...
Pravega - Streaming as a new software defined storage primitive
A lightweight stream processing library for Go
Trill is a single-node query processor for temporal or streaming data.
Real-time stream processing for python
Python Stream Processing
📐 Pushing the boundaries of simplicity
⚡ Single-pass algorithms for statistics