Easy to use library to bring Tensorflow on Apache Spark
Data Accelerator for Apache Spark simplifies onboarding to Streaming of ...
Notes on Apache Spark (pyspark)
A Spark Atlas connector to track data lineage in Apache Atlas
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clus...
A pure Python implementation of Apache Spark's RDD and DStream interfaces.
Apache Spark™ and Scala Workshops
This is archive of SparkRDMA project. The new repository with RDMA shuff...
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
A complete example of a big data application using : Kubernetes (kops/aw...
? Various cheatsheets in PDF
Apache Spark Connector for Azure Cosmos DB
Toolkit for Apache Spark ML for Feature clean-up, feature Importance cal...
Profile and monitor your ML data pipeline end-to-end
Use the TPC-DS benchmark to test Spark SQL performance