Miller is like awk, sed, cut, join, and sort for name-indexed data such ...
A collection of handy Bash One-Liners and terminal tricks for data proce...
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLM...
List of libraries, tools and APIs for web scraping and data processing.
A GPU-accelerated library containing highly optimized building blocks an...
Select, put and delete data from JSON, TOML, YAML, XML and CSV files wit...
A light-weight, flexible, and expressive statistical data testing library
Toolkit for Machine Learning, Natural Language Processing, and Text Gene...
Large-scale pretraining for dialogue
Concurrent and multi-stage data ingestion and data processing with Elixir
Extract Transform Load for Python 3.5+
Source code accompanying book: Data Science on the Google Cloud Platform...
Python Stream Processing
Kubernetes-native platform to run massively parallel data/streaming jobs
Google Cloud Dataflow provides a simple, powerful model for building bot...