Implementing best practices for PySpark ETL jobs and applications.
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Wareh...
Mass processing data with a complete ETL for .net developers
Provides guidance for fast ETL jobs, an IDataReader implementation for S...