Database Subsetting and Relational Data Browsing Tool.
Open-source data observability for analytics engineers.
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Wareh...
Synmetrix – open source semantic layer / Boost your LLM precision
Free and open source schema versioning and database migration made nativ...
CLI tool for dbt users to simplify creation of staging models (yml and s...
A set of UDFs and Procedures to extend BigQuery, Snowflake, Redshift, Po...
AWS Data/MLServices sample code & notes for my LinkedIn Learning courses
A SQL port of python's scikit-learn preprocessing module, provided as cr...
Pytest Fixtures that let you actually test against external resource (Po...
Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowf...
A full data warehouse infrastructure with ETL pipelines running inside d...
A color temperature setting library and CLI that operates in a similar w...
A Data Platform built for AWS, powered by Kubernetes.
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated ...