Official repository of Trino, the distributed SQL query engine for big d...
The Metadata Platform for your Data Stack
The most widely used Python to C compiler
StarRocks, a Linux Foundation project, is a next-generation sub-second M...
A fast, scalable, high performance Gradient Boosting on Decision Trees l...
Apache Beam is a unified programming model for Batch and Streaming data ...
An open-source storage framework that enables building a Lakehouse archi...
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Pla...
Apache Storm
Web-based notebook that enables data-driven, interactive data analytics ...
Cloud-native search engine for observability. An open-source alternative...
Arkime is an open source, large scale, full packet capturing, indexing, ...
Data-Centric Pipelines and Data Versioning
Seamless multi-master syncing database with an intuitive HTTP/JSON API, ...
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, ...