Feathr – A scalable, unified data and AI engineering platform for enterp...
data load tool (dlt) is an open source Python library that makes data lo...
Datart is a next generation Data Visualization Open Platform
Distributed DataFrame for Python designed for the cloud, powered by Rust
:bar_chart: :clipboard: Dashboards using YAML or JSON files
An Awesome List of Open-Source Data Engineering Projects
Implementing best practices for PySpark ETL jobs and applications.
Hamilton helps data scientists and engineers define testable, modular, s...
Few projects related to Data Engineering including Data Modeling, Infras...
MLRun is an open source MLOps platform for quickly building and managing...
Quilt is a data mesh for connecting people with actionable data
Source code accompanying book: Data Science on the Google Cloud Platform...
Clean APIs for data cleaning. Python implementation of R package Janitor
A comprehensive list of 180+ YouTube Channels for Data Science, Data En...
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Wareh...