Data pipelines from re-usable components
The Data Engineering Book - หนังสือวิศวกรรมข้อมูล ของคนไทย เพื่อคนไทย
A cross-platform tool for data pipelines.
python ETL framework
Serverless Data Pipeline powered by Kinesis Firehose, API Gateway, Lambd...
(project & tutorial) dag pipeline tests + ci/cd setup
Compose multimodal datasets 🎹
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
A curated list of open source tools used in analytical stacks and data e...
一个基于浏览器环境的数据采集SDK
⚡️ Next-generation data transformation framework for TypeScript that pu...
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pi...
Copy Pandas DataFrames and HDF5 files to PostgreSQL database
Learn the basics of Apache Kafka® from leaders in the Kafka community wi...