Fancy stream processing made operationally mundane
SeaTunnel is a next-generation super high-performance, distributed, mass...
Build data pipelines, the easy way 🛠️
Implementing best practices for PySpark ETL jobs and applications.
Hamilton helps data scientists and engineers define testable, modular, s...
Few projects related to Data Engineering including Data Modeling, Infras...
Yet another cron alternative with a Web UI, but with much more capabilit...
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Wareh...
A scalable general purpose micro-framework for defining dataflows. THIS ...
A Clojure high performance data processing system
A simplified, lightweight ETL Framework based on Apache Spark
Extensible data integration Java framework for building XML and non-XML ...