Chaos Engineering
Chaos engineering is the discipline of experimenting on a software system in production in order to build confidence in the systemβs capability to withstand turbulent and unexpected conditions. Chaos engineering is a disciplined approach to identifying failures before they become outages
- GitHub: https://github.com/topics/chaos-engineering
- Wikipedia: https://en.wikipedia.org/wiki/Chaos_engineering
- Related Topics: sre, testing,
- Last updated: 2026-06-27 00:04:44 UTC
- JSON Representation
https://github.com/chaostoolkit/chaostoolkit-bundler
Bundle the Chaos Toolkit CLI and all the drivers/plugins into one standalone binary for Linux, MacOSX and Windows
chaos-engineering chaostoolkit
Last synced: 27 Apr 2026
https://github.com/chaos-mesh/datasource
Grafana data source plugin for Chaos Mesh.
chaos-engineering chaos-mesh data-source grafana-datasource grafana-plugin
Last synced: 10 Apr 2025
https://github.com/chaostoolkit-incubator/chaostoolkit-gremlin
Gremlin attack support for the Chaos Toolkit
chaos-engineering chaostoolkit-extension chaostoolkit-gremlin gremlin
Last synced: 08 May 2025
https://github.com/steadybit/extension-kong
A Steadybit attack implementation to inject HTTP faults into Kong API gateway.
api-gateway chaos chaos-engineering chaos-testing http kong nginx rest
Last synced: 10 Mar 2026
https://github.com/chaostoolkit-incubator/chaostoolkit-saltstack
Chaos Toolkit driver extension for SaltStack
chaos-engineering chaostoolkit chaostoolkit-extension saltstack
Last synced: 08 May 2025
https://github.com/steadybit/extension-aws
A Steadybit discovery and action implementation to inject faults into various AWS services.
attack aws chaos chaos-engineering cloud lambda rds resilience
Last synced: 18 May 2026
https://github.com/nawafswe/mockchaos
Mock HTTP/gRPC servers with chaos simulation. Configure latency, status codes, and failures via JSON for tests and Kubernetes deployments.
chaos-engineering engineering-platforms golang grpc http mock-server
Last synced: 24 Dec 2025
https://github.com/chaostoolkit-incubator/chaostoolkit-toxiproxy
Chaos Toolkit Driver for Toxiproxy
chaos-engineering chaostoolkit chaostoolkit-extension toxiproxy
Last synced: 08 May 2025
https://github.com/steadybit/extension-kubernetes
A Steadybit extension to check the state of the Kubernetes cluster and inject faults.
chaos-engineering chaos-testing helm kubernetes reliability
Last synced: 24 Feb 2026
https://github.com/chaostoolkit-incubator/kubernetes-crd
Kubernetes CRD for the Chaos Toolkit
chaos-engineering chaostoolkit kubernetes kubernetes-controller
Last synced: 15 Jul 2025
https://github.com/steadybit/extension-datadog
A Steadybit check implementation for data exposed through Datadog.
chaos-engineering chaos-testing check datadog monitoring observability probe
Last synced: 21 Feb 2026
https://github.com/mikhailknyazev/kube-course
Main resources for Udemy course "Configuring Kubernetes for Reliability with LitmusChaos"
chaos-engineering eks helm kubernetes litmuschaos pipelines reliability terraform
Last synced: 15 Apr 2025
https://github.com/young-ook/terraform-aws-fis
Terraform Module: AWS FIS
aws chaos-engineering resilience terraform
Last synced: 12 Apr 2025
https://github.com/grafana/xk6-disruptor-demo
A complete demo of xk6-disruptor
Last synced: 19 Oct 2025
https://github.com/steadybit/extension-prometheus
A Steadybit check implementation to gather and verify the result PromQL queries.
chaos-engineering chaos-testing kubernetes monitoring prometheus promql
Last synced: 10 Mar 2026
https://github.com/steadybit/extension-postman
A Steadybit extension to execute Postman collections via Postman Cloud Api
chaos-engineering chaos-testing cloud postman
Last synced: 10 Mar 2026
https://github.com/gittorre/cloudbedlamlinuxn
CloudBedlam for Linux -- Native (C++) Impl: CloudBedlam is a simple, configurable, machine-local chaotic operation orchestrator for resiliency experimentation inside virtual and physical machines. This version is for Linux machines.
c chaos chaos-engineering cpp linux resiliency virtual-machine
Last synced: 19 Apr 2025
https://github.com/azure/platform-chaos-api
An API for introducing chaos into Azure PaaS offerings using configurable extensions π©
azure chaos-engineering nodejs
Last synced: 07 Oct 2025
https://github.com/denniszielke/resilient-cloud-apps
Demonstrating resilient application patterns in the Azure cloud
application-insights azure chaos-engineering kubernetes
Last synced: 06 Apr 2025
https://github.com/mlafeldt/sysrq
Go client to the Linux SysRq interface
chaos-engineering golang linux sysrq
Last synced: 02 Apr 2025
https://github.com/chaostoolkit-incubator/chaostoolkit-google-cloud-platform
Chaos Toolkit extension for Google Cloud Platform
chaos-engineering chaostoolkit chaostoolkit-extension gcp google-cloud-platform
Last synced: 08 May 2025
https://github.com/zeljkox/react-native-pseudo-localization
React Native Pseudo Localization is small package that enables pseudolocalization
chaos-engineering i10n i18n internationalization language pseudo pseudo-localization react-native
Last synced: 22 Jun 2025
https://github.com/qainsights/learn-chaos-engineering-series
Learn Chaos Engineering Series
chaos chaos-engineering chaos-experiments chaostoolkit litmuschaos
Last synced: 17 Feb 2026
https://github.com/chaostoolkit-incubator/chaostoolkit-slack
Chaos Toolkit Plugin for Slack
chaos-engineering chaostoolkit chaostoolkit-integration slack
Last synced: 08 May 2025
https://github.com/k-krew/omen
A lightweight, declarative chaos engineering operator for Kubernetes
chaos-engineering chaos-testing controller-runtime fault-injection golang helm-chart kubebuilder kubernetes kubernetes-operator reliability sre
Last synced: 26 Apr 2026
https://github.com/chaostoolkit-incubator/chaostoolkit-cloud-foundry
Cloud Foundry extension for the Chaos Toolkit
chaos-engineering chaostoolkit-extension cloudfoundry pcf
Last synced: 08 May 2025
https://github.com/snapp-incubator/chaos-operator
A kubernetes operator for chaos engineering
chaos chaos-engineering chaos-operator k8s kubernetes kubernetes-operator operator testing testing-tools toxiproxy
Last synced: 15 Jul 2025
https://github.com/clivern/apes
π¨ Chaos and Resiliency Testing Service.
apes chaos chaos-engineering developer-tools failure-detection microservices microservices-architecture reverse-proxy
Last synced: 19 Apr 2025
https://github.com/chaostoolkit-incubator/chaostoolkit-google-cloud
[DEPRECATED] Chaos Extension for the Google Cloud Engine platform
chaos-engineering chaostoolkit chaostoolkit-extension gce google-cloud
Last synced: 22 Apr 2025
https://github.com/kxk-4498/venafi-test-wizard
A testing tool for cert-manager in Kubernetes
certificate chaos chaos-engineering chaos-testing crd external-issuer fault-injection go golang kubernetes microservices pki tls x509certificates
Last synced: 14 Jan 2026
https://github.com/chaostoolkit-incubator/chaostoolkit-opentracing
Chaos Toolkit extension for Open Tracing and Open Telemetry
chaos-engineering chaostoolkit chaostoolkit-extension jaeger-tracer opentracing
Last synced: 08 May 2025
https://github.com/techprimers/defensive-demo
Defensive Demo to test Chaos Engineering using Chaos Toolkit and Chaos Monkey for Spring Boot
chaos-engineering chaos-monkey chaos-monkey-spring-boot chaos-toolkit defensive-demo spring-boot
Last synced: 12 Oct 2025
https://github.com/buraksenyurt/resistance
It is a library containing functions that create a chaotic environment for durability tests in web app environments.
chaos-engineering distributed-systems dotnet-core resiliency
Last synced: 10 Apr 2025
https://github.com/steadybit/reliability-hub-db
Database containing the content for Steadybit's Reliability Hub
chaos-engineering chaos-testing database reliability resilience steadybit
Last synced: 14 Apr 2025
https://github.com/chaostoolkit/chaostoolkit-extension-template
Template starting point for a new Python-based Chaos Toolkit extension
chaos-engineering chaostoolkit
Last synced: 01 Sep 2025
https://github.com/krokz/ansible-doom
Deploy Ansible configurations by killing enemies in DOOM!
ansible chaos-engineering doom
Last synced: 18 May 2026
https://github.com/upgundecha/chaos-testing-demo
This is an example project prepared for demonstrating Chaos Engineering experiment on a Spring boot application using Chaos Monkey and ChaosToolkit
chaos-engineering chaos-monkey chaos-test chaos-testing chaostoolkit spring-boot
Last synced: 30 Jul 2025
https://github.com/chaostoolkit-incubator/chaostoolkit-humio
Extension for integrating with Humio
chaos-engineering chaostoolkit chaostoolkit-extension chaostoolkit-integration humio
Last synced: 08 May 2025
https://github.com/chaostoolkit-incubator/chaostoolkit-wiremock
Chaos Toolkit driver for the WireMock service API
chaos-engineering chaostoolkit chaostoolkit-extension wiremock
Last synced: 08 May 2025
https://github.com/garugaru/flaw
CLI for injecting failures on api calls for local chaos engineering
chaos chaos-engineering failure-injection
Last synced: 27 Jun 2025
https://github.com/litmuschaos/backstage-plugin
Backstage plugin where you can find all the information you need to run chaos engineering on Litmus in one place
backstage backstage-plugin chaos-engineering litmus litmuschaos
Last synced: 03 Mar 2026
https://github.com/rickcau/azure-chaos-studio
azure-chaos-studio chaos-engineering
Last synced: 23 Jan 2026
https://github.com/dgzlopes/traefik-fault-injection
Traefik Middleware Plugin - Fault Injection via HTTP headers
chaos-engineering delay failures fault-injection traefik traefik-plugin
Last synced: 03 Apr 2025
https://github.com/localstack-samples/sample-chaos-serverless-multi-region-failover
Demonstrates chaos engineering in a multi-region serverless application using API Gateway, Lambda, DynamoDB, and Route53. Test resilience, automated failover, and data integrity with LocalStack's Chaos API.
chaos-engineering chaos-testing dynamodb failover lambda localstack-sample-app multi-region resiliency-engineering route53 serverless
Last synced: 08 Sep 2025
https://github.com/houseofcat/gremlins.demo
Demonstration of Gremlins in the code creating chaos!
chaos-engineering chaos-testing csharp failure-injection failure-injection-testing fault-injection fault-tolerant net netcore
Last synced: 15 Aug 2025
https://github.com/TrollScripts/cpu-troll
Part of the larger OpenChaos Suite, dedicated to raising CPU latency by the requested percentage and timespan.
chaos chaos-engineering cpu-load golang
Last synced: 09 Jul 2025
https://github.com/mattyopon/faultray
Zero-risk infrastructure chaos simulation β 5 engines, 2000+ scenarios, 3-Layer availability proof. No production fault injection.
availability chaos-engineering devops infrastructure python resilience simulation sre
Last synced: 02 Apr 2026
https://github.com/richardbolt/shrike
The Shrike is a Chaos HTTP/WebSocket proxy that impales victims on the Tree of Pain by selectively routing traffic with path based rules via Toxiproxy for TCP level mischief
chaos chaos-engineering go http-proxy
Last synced: 16 Aug 2025
https://github.com/iskorotkov/chaos-scheduler
Service for automatic generation and scheduling of the chaos test workflows
backend chaos-engineering docker frontend go kubernetes web
Last synced: 09 Apr 2026
https://github.com/ibra-kdbra/math-problems
Math problems I found nice enough to implement, beginning with easy problems, then I will add some intermediate, advanced problems.
ai art chaos-engineering chaos-theory lorenz-attractor mandelbrot-set math randomization sfml2
Last synced: 01 May 2025
https://github.com/copyleftdev/rustocache
π¦ The Ultimate High-Performance Caching Library for Rust - 100x faster than JavaScript alternatives with sub-microsecond latencies, memory safety, and chaos engineering
async benchmarking cache caching chaos-engineering high-performance lru memory-safety performance redis rust simd stampede-protection tokio zero-copy
Last synced: 20 Apr 2026
https://github.com/sotirisalfonsos/chaos-master
The chaos master is part of a chaos project and provides an api to send fault injections to the chaos bots
chaos chaos-engineering chaos-master
Last synced: 14 Jan 2026
https://github.com/steadybit/event-kit
The Steadybit EventKit enables extensions to consume Steadybit events (similar to web hooks).
chaos-engineering chaos-testing experiment extension steadybit tags webhook
Last synced: 25 Jun 2025
https://github.com/chaos-mesh/tidb-chaos
Recording the method, process, and results of chaos engineering testing of TiDB by using Chaos Mesh.
chaos-engineering chaos-mesh tidb
Last synced: 21 Jan 2026
https://github.com/josephwoodward/chaosproxy
A simple means of introducing chaos into your infrastructure for Windows, Mac and Linux.
Last synced: 20 Jul 2025
https://github.com/cemakan/pastaay
Cloud native chaos engineering toolkit for Go. Inject faults across HTTP, gRPC, Kafka, Redis, SQL, and MongoDB. YAML defined policies, multi model Oracle AI (DeepSeek, Gemini, Anthropic, OpenAI), built in blast radius guard, and a drag and drop web console with live telemetry.
aiops chaos-engineering deepseek fault-injection gitops golang kafka kubernetes-operator llm mongodb observability prometheus rabbitmq resilience telemetry yaml
Last synced: 29 May 2026
https://github.com/visible/cruel
chaos testing with zero mercy
chaos chaos-engineering chaos-testing fault-injection nodejs reliability resilience testing typescript
Last synced: 28 Jan 2026
https://github.com/kiko-g/internet-of-everything
FEUP DS | Large Scale Software Development | 2021/22
chaos-engineering digital-twin edge-computing failure-detection internet-of-everything iot
Last synced: 17 Mar 2025
https://github.com/steadybit/action-kit
The Steadybit ActionKit enables the extension of Steadybit with new action capabilities that you can use within experiments.
attack chaos-engineering experiment extension healthcheck load-testing steadybit
Last synced: 14 Apr 2025
https://github.com/waldekmastykarz/dev-proxy-aspire
Use Dev Proxy extensions for .NET Aspire to seamlessly integrate Dev Proxy into your distributed applications.
api-testing aspire chaos-engineering dev-proxy developer-tools development devtools http microsoft-365 mock-server openapi proxy resilience rest
Last synced: 15 May 2025
https://github.com/adamdecaf/badnet
Simulating bad networks without injecting net.Conn
chaos-engineering fault-injection network
Last synced: 08 Nov 2025
https://github.com/pixelfactoryio/crashlooper
A simple container that crash π₯ after a set amount of time β°
Last synced: 01 Mar 2026
https://github.com/iskorotkov/chaos-workflows
Service providing REST API for working with Argo workflows
chaos-engineering docker go kubernetes
Last synced: 10 Apr 2026
https://github.com/rberrelleza/intro-to-chaos-engineering
Learn about Chaos Engineering!
chaos-engineering cloudnative docker litmuschaos
Last synced: 02 May 2026
https://github.com/chaosync-org/awesome-ai-agent-testing
π€ A curated list of resources for testing AI agents - frameworks, methodologies, benchmarks, tools, and best practices for ensuring reliable, safe, and effective autonomous AI systems
agent-evaluation agentic-ai ai-agents ai-benchmark ai-safety artificial-intelligence awesome-list benchmark chaos chaos-engineering chaos-monkey evaluation llm llm-evaluation machine-learning qa quality-assurance testing testing-tools
Last synced: 29 Jun 2025
https://github.com/sysaulab/libseedy
Portable seedless random generator
chaos chaos-engineering chaos-theory prng rng
Last synced: 10 Mar 2026
https://github.com/arun0009/go-resilience-mock
Chaos engineering in a box. A high-performance mock server to test your API's resilience against latency, failures, and resource exhaustion
chaos-engineering cpu-stress fault-injection go golang http-mock mock-server observability prometheus resilience-testing sre
Last synced: 13 Jan 2026
https://github.com/chaostoolkit-incubator/chaostoolkit-instana
Extension for working with Instana (https://www.instana.com/) from the Chaos Toolkit
chaos-engineering chaostoolkit chaostoolkit-extension
Last synced: 01 May 2026
https://github.com/mattermost-community/mattermost-app-chaosengine
An integration with Mattermost to run Chaos GameDays
Last synced: 14 Apr 2025
https://github.com/k0rventen/macaque
a kubernetes chaos monkey implementation in go
chaos-engineering chaos-monkey golang kubernetes
Last synced: 17 May 2026
https://github.com/steadybit/extension-container
A Steadybit extension for container based actions (discovery / attacks)
attack chaos-engineering container fault helm kubernetes network process stress
Last synced: 13 May 2026
https://github.com/florextech/chaos-internet-simulator
Break your network before your users do. Local chaos proxy + dashboard + CLI for testing unstable internet and APIs.
chaos-engineering developer-tools docker fastify open-source proxy react typescript
Last synced: 02 May 2026
https://github.com/gunnargrosch/failure-cloudfunctions
Module for failure injection into Cloud Functions
chaos chaos-engineering cloudfunctions failure failure-injection gcp google
Last synced: 12 Jan 2026
https://github.com/githubfoam/chaos-k8s-sandbox
chaos engineering kubernetes pipeline
chaos-engineering chaos-mesh chaos-testing devsecops k3d kind kubernetes microk8s minikube observability site-reliability-engineering travis-ci
Last synced: 27 Apr 2026
https://github.com/mendrugory/pod-chaos-monkey
Pod chaos monkey is a PoC of a chaos engineering for Kubernetes which will help us to test the reliability of our system. It will killed pod, in a desired namespace in a schedule.
chaos chaos-engineering chaos-monkey kubernetes
Last synced: 27 Apr 2026
https://github.com/steadybit/extension-host
A Steadybit extension for host based actions (discovery / attacks)
attack chaos-engineering fault helm host kubernetes network process stress timetravel
Last synced: 12 Jun 2026
https://github.com/iskorotkov/chaos-framework
Chaos Framework is a platform for easy resilience testing in Kubernetes
argo chaos-engineering docker kubernetes litmus
Last synced: 24 Apr 2026
https://github.com/steadybit/extension-gcp
A Steadybit discovery and action implementation to inject faults into various Google Cloud services.
attack chaos chaos-engineering gcp google google-cloud google-cloud-platform resilience vm
Last synced: 20 Apr 2026
https://github.com/gsscoder/chaos-appservices
This repository is meant to share my chaos engineering experiments to achieve knowledge in designing resilient systems.
azure chaos-engineering csharp demo dotnet
Last synced: 17 Apr 2026
https://github.com/sauravbhattacharya001/ai
Contract-enforced sandbox for studying AI agent self-replication safety
agent-safety ai-agents ai-governance ai-safety anomaly-detection chaos-engineering containment docker forensics game-theory kill-switch prompt-injection python red-teaming replication-control research safety-research sandboxing self-replication threat-modeling
Last synced: 23 Apr 2026
https://github.com/briandenicola/eshoponaks
An AKS workshop based on the eShop application
aks architecture cert-manager chaos-engineering dotnet eshop istio openai opentelemetry prometheus
Last synced: 05 Mar 2026
https://github.com/sbalnojan/ai-chaos-awesome
Awesome list for AI chaos engineering: experiments, evaluations, guardrails & observability for LLM/RAG.
ai-chaos-engineering awesome awesome-list chaos-engineering evals llm mlops rag red-teaming reliability
Last synced: 07 Sep 2025
https://github.com/abhishek010397/elkwithchaos
chaos-engineering chaos-monkey elk-stack java testing video
Last synced: 04 Sep 2025
https://github.com/githubfoam/gremlin-travisci
gremlin chaos engineering
chaos-engineering devsecops gremlin observability pipeline site-reliability-engineering
Last synced: 07 Jan 2026
https://github.com/steadybit/steadybit-debug
steadybit-debug collects data from installed Steadybit platforms and agents to aid in customer support
chaos-engineering debugging steadybit
Last synced: 14 Apr 2025
https://github.com/kinfinity/distributed-resilience
Patterns for building Resilient Distributed Systems in Golang
chaos-engineering distributed-systems golang-module resilience
Last synced: 01 Feb 2026
https://github.com/konstruktoid/disruella
A very small digitalized primate responsible for randomly preventing something from continuing as usual or as expected.
chaos-engineering hacktoberfest high-availability python-black python3 resilience sre systemd test-automation
Last synced: 16 Feb 2026
https://github.com/geekxflood/amctl
amctl is a command line to interact with AlertManager API
alertmanager api chaos-engineering go golang observability prometheus testing testing-tools
Last synced: 11 Mar 2026
https://github.com/karimsa/frenzie
Collection of chaos hooks for your Node.js programs.
chaos-engineering hooks javascript library testing
Last synced: 24 Oct 2025
https://github.com/chaostoolkit-incubator/chaostoolkit-datadog
Chaos Toolkit extension for DataDog
chaos-engineering chaostoolkit chaostoolkit-extension datadog
Last synced: 22 Feb 2026
https://github.com/githubfoam/gremlin-githubactions
gremlin githubactions chaos engineering devsecops site-reliability-engineering observability
chaos-engineering devsecops githubactions gremlin observability site-reliability-engineering
Last synced: 23 Feb 2026
https://github.com/steadybit/extension-azure
A Steadybit discovery and action implementation to inject faults into various Azure services.
attack azure chaos chaos-engineering kubernetes-services resilience scalesets virtual-machine vm
Last synced: 21 Feb 2026
https://github.com/jsabo/aws-lambda-failure-flag-app
This project demonstrates how to integrate Gremlin Failure Flags into an AWS Lambda function, enabling you to simulate injected latency and exceptions while measuring processing performance. Itβs a serverless demo for testing resiliency and fault injection in real-world cloud environments.
aws chaos-engineering gremlin lambda performance-testing reliability-engineering
Last synced: 08 Jul 2025
https://github.com/rebound-how/rebound
The open source toolbox for resilient operations
agentic-ai ai chaos-engineering chaostoolkit devops mcp-server reliability-engineering reliability-tools resilience resilience-testing sre
Last synced: 08 Jul 2025
https://github.com/ngvuthdanhh/certificate-chaos-engineering-practitioner-gremlin
Notes, labs, research, and certificate for the Chaos Engineering Practitioner Gremlin program. Learn chaos principles, failure injection, observability, and resilience patterns.
chaos-engineering cloud-security distributed-systems fault-injection githublearning gremlin observability reliability resilience
Last synced: 25 Jan 2026
https://github.com/portefaix/portefaix-chaos
Portefaix Chaos
chaos-engineering chaos-experiments chaos-testing galactus kubernetes portefaix
Last synced: 11 Jul 2025
https://github.com/officialasishkumar/streamforge
Self-hosted real-time event analytics platform β Kafka, Postgres, S3 archive, full chaos and perf regression suite
analytics chaos-engineering distributed-systems event-driven event-streaming golang kafka kubernetes observability postgresql
Last synced: 17 May 2026
https://github.com/iskorotkov/chaos-muxy
Experiment with Muxy proxy for causing network failures in target apps
chaos-engineering docker golang muxy
Last synced: 17 May 2026
https://github.com/iskorotkov/chaos-client
Simple app used in chaos testing as a potential target. It can ping specified server at a fixed rate using /counter endpoint
chaos-engineering docker golang
Last synced: 18 May 2026