The leading data integration platform for ETL / ELT data pipelines from ...
Kedro is a toolbox for production-ready data science. It uses software e...
SeaTunnel is a next-generation super high-performance, distributed, mass...
The enterprise-grade behavioral data engine (web, mobile, server-side, w...
Infinitely scalable, event-driven, language-agnostic orchestration and s...
Flink CDC is a streaming data integration tool
Privacy and Security focused Segment-alternative, in Golang and React
A list of useful resources to learn Data Engineering from scratch
An open-source data logging library for machine learning models and data...
task management & automation tool
A lightweight stream processing library for Go
BitSail is a distributed high-performance data integration engine which ...
Source code accompanying book: Data Science on the Google Cloud Platform...
Open-source data observability for analytics engineers.
Smarter data pipelines for audio.