Best 12 Map Reduce Open Source Projects

Fast, efficient, and scalable distributed map/reduce system, DAG executi...

A search engine which can hold 100 trillion lines of log data.

AIStore: scalable storage for AI applications

Kubernetes-native platform to run massively parallel data/streaming jobs

Efficient transducers for Julia

Fundamentals of Spark with Python (using PySpark), code examples

Data science and Big Data with Python

Fast & furious GroupBy operations for dask.array

Prosto is a data processing toolkit radically changing how data is proce...

Efficient and scalable parallelism using the message passing interface (...

Data-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)

The core parallel and shared memory library used by Hack, Flow, and Pyre