Apache Spark (PySpark) Practice on Real Data
Generate relevant synthetic data quickly for your projects. The Databri...
MorphL Community Edition uses big data and machine learning to predict u...
Big Data Processing Framework - Unified Data API or SQL on Any Storage
LearningApacheSpark
Isolation Forest on Spark
Apache Spark Connector for Azure Cosmos DB
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Toolkit for Apache Spark ML for Feature clean-up, feature Importance cal...
HandySpark - bringing pandas-like capabilities to Spark dataframes
Research project aimed to classify the best stock research posts from r/...
Updated repository
Open Source Contributor Index
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google...