t-Digest data structure in Python. Useful for percentiles and quantiles,...
Uniffle is a high performance, general purpose Remote Shuffle Service.
Dynamic execution framework for your Redis data
Cascading is a feature rich API for defining and executing complex and f...
Compass is a task diagnosis platform for bigdata
Behemoth is an open source platform for large scale document analysis ba...
Firestorm is a Remote Shuffle Service, and provides the capability for A...
🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉
:zap: 6.824: Distributed Systems (Spring 2017). A course which present a...
An easy-to-use Map Reduce Go parallel-computing framework inspired by 20...
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Companion to Learning Hadoop and Learning Spark courses on Linked In Lea...
A in-process MapReduce library to help you optimizing service response t...
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Hadoop, MapReduce Distributed Crawling of Data Information from All Chin...