Apache Doris is an easy-to-use, high performance and unified analytics d...
Official repository of Trino, the distributed SQL query engine for big d...
StarRocks, a Linux Foundation project, is a next-generation sub-second M...
An open-source storage framework that enables building a Lakehouse archi...
Create full-fledged APIs for slowly moving datasets without writing a si...
A native Rust library for Delta Lake, with bindings into Python
This is the github repo for Learning Spark: Lightning-Fast Data Analytic...
Amazon SageMaker Local Mode Examples
The Lakehouse Engine is a configuration driven Spark framework, written ...
The Internals of Delta Lake
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
Lakehouse storage system benchmark
DeltaOMS is a solution that help build a centralized repository of Delta...
Books and Papers in Mathematics, Econometrics, Machine Learning, Financ...