Fancy stream processing made operationally mundane
SeaTunnel is a next-generation super high-performance, distributed, mass...
Build data pipelines, the easy way 🛠️
Implementing best practices for PySpark ETL jobs and applications.
Few projects related to Data Engineering including Data Modeling, Infras...
Hamilton helps data scientists and engineers define testable, modular, s...
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Wareh...
Yet another cron alternative with a Web UI, but with much more capabilit...
A scalable general purpose micro-framework for defining dataflows. THIS ...
A Clojure high performance data processing system
A simplified, lightweight ETL Framework based on Apache Spark
Extensible data integration Java framework for building XML and non-XML ...