A powerful flow control component enabling reliability, resilience and m...
A curated list of Site Reliability and Production Engineering resources.
A curated collection of publicly available resources on how technology a...
Compilation of public failure/horror stories related to Kubernetes
Hands on labs and code to help you learn, measure, and build using archi...
Chaos Engineering Toolkit & Orchestration for Developers
A free book about developing secure and robust systems software.
A curated list of Site Reliability and Production Engineering Tools
Sample implementations for cloud design patterns found in the Azure Arch...
Production-grade retries for Python
A hosted disposable email telegram bot; Extremely privacy friendly; Prou...
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and R...
A framework for rapid development of reliable asynchronous software.
An always-on framework that performs end-to-end functional network testi...
An Open-Source Collection of 230+ Flash Cards to Help You Succeed in You...