A powerful flow control component enabling reliability, resilience and m...
A curated list of Site Reliability and Production Engineering resources.
A curated collection of publicly available resources on how technology a...
Compilation of public failure/horror stories related to Kubernetes
Hands on labs and code to help you learn, measure, and build using archi...
Chaos Engineering Toolkit & Orchestration for Developers
It's just fascinating. How is modern software designed? 🤔 Some design-l...
A free book about developing secure and robust systems software.
A curated list of Site Reliability and Production Engineering Tools
Sample implementations for cloud design patterns found in the Azure Arch...
Production-grade retries for Python
A hosted disposable email telegram bot; Extremely privacy friendly; Prou...
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and R...
An always-on framework that performs end-to-end functional network testi...
A framework for rapid development of reliable asynchronous software.