Real Time Analytics and Data Pipelines based on Spark Streaming
Web tool for Kafka Connect |
Kafka Connect HDFS connector
Big Data Ecosystem Docker
StorageTapper is a scalable realtime MySQL change data streaming, logica...
Fundamentals of Spark with Python (using PySpark), code examples
Divolte Collector
API and command line interface for HDFS
weather radar data processing - python package
🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries...
Python interface to the TileDB storage engine
ElasticCTR,即飞桨弹性计算推荐系统,是基于Kubernetes的企业级推荐系统开源...
DC/OS SDK is a collection of tools, libraries, and documentation for eas...
HDFS Shell is a HDFS manipulation tool to work with functions integrated...