☄️ Python's nested data operator (and CLI), for all your declarative res...
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cu...
Logical Replication extension for PostgreSQL 15, 14, 13, 12, 11, 10, 9.6...
Scalable identity resolution, entity resolution, data mastering and dedu...
A block-based API for NSValueTransformer, with a growing collection of u...
Optimus is an easy-to-use, reliable, and performant workflow orchestrato...
Microsoft Program Synthesis using Examples SDK is a framework of technol...
:lipstick: Durable and asynchronous data imports for consuming data at s...
Advanced and Fast Data Transformation in R
Like awk but with SQL and table joins
Low-code Python library to safely use notebooks in production: schedule ...
📄 Concise selector to extract JSON from HTML.
An Extensible Suite of High-Performance and Low-Dependency Packages for ...
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
A simple Spark-powered ETL framework that just works 🍺