Fast, efficient, and scalable distributed map/reduce system, DAG executi...
A search engine which can hold 100 trillion lines of log data.
AIStore: scalable storage for AI applications
Kubernetes-native platform to run massively parallel data/streaming jobs
Efficient transducers for Julia
Fundamentals of Spark with Python (using PySpark), code examples
Data science and Big Data with Python
Fast & furious GroupBy operations for dask.array
Prosto is a data processing toolkit radically changing how data is proce...
Efficient and scalable parallelism using the message passing interface (...
Data-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
The core parallel and shared memory library used by Hack, Flow, and Pyre