专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
SeaTunnel is a next-generation super high-performance, distributed, mass...
An open-source storage framework that enables building a Lakehouse archi...
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Pla...
A Flexible and Powerful Parameter Server for large-scale machine learning
Alluxio, data orchestration for analytics and machine learning in the cloud
Web-based notebook that enables data-driven, interactive data analytics ...
macOS development environment setup: Easy-to-understand instructions wi...
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, ...
Python SQL Parser and Transpiler
Simple and Distributed Machine Learning
🧙 The modern replacement for Airflow. Build, run, and manage data pipel...
PipelineAI
the portable Python dataframe library
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.