Fast, Scientific and Numerical Computing for the JVM (NDArrays)
A large-scale entity and relation database supporting aggregation of pro...
ApacheCN 开源组织:公告、介绍、成员、活动、交流方式
Elassandra = Elasticsearch + Apache Cassandra
Official code repository for GATK versions 4 and up
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Mach...
Distributed Deep learning with Keras & Spark
A Scala kernel for Jupyter
后端 (Java Golang)全栈知识架构体系总结
:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及...
Implementing best practices for PySpark ETL jobs and applications.
Machine Learning Platform and Recommendation Engine built on Kubernetes
MLeap: Deploy ML Pipelines to Production
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cu...
The Internals of Apache Spark