An open API service indexing awesome lists of open source software.

Chaos Engineering

Chaos engineering is the discipline of experimenting on a software system in production in order to build confidence in the system’s capability to withstand turbulent and unexpected conditions. Chaos engineering is a disciplined approach to identifying failures before they become outages

https://github.com/chaostoolkit/chaostoolkit-bundler

Bundle the Chaos Toolkit CLI and all the drivers/plugins into one standalone binary for Linux, MacOSX and Windows

chaos-engineering chaostoolkit

Last synced: 27 Apr 2026

https://github.com/chaos-mesh/datasource

Grafana data source plugin for Chaos Mesh.

chaos-engineering chaos-mesh data-source grafana-datasource grafana-plugin

Last synced: 10 Apr 2025

https://github.com/steadybit/extension-kong

A Steadybit attack implementation to inject HTTP faults into Kong API gateway.

api-gateway chaos chaos-engineering chaos-testing http kong nginx rest

Last synced: 10 Mar 2026

https://github.com/steadybit/extension-aws

A Steadybit discovery and action implementation to inject faults into various AWS services.

attack aws chaos chaos-engineering cloud lambda rds resilience

Last synced: 18 May 2026

https://github.com/nawafswe/mockchaos

Mock HTTP/gRPC servers with chaos simulation. Configure latency, status codes, and failures via JSON for tests and Kubernetes deployments.

chaos-engineering engineering-platforms golang grpc http mock-server

Last synced: 24 Dec 2025

https://github.com/steadybit/extension-kubernetes

A Steadybit extension to check the state of the Kubernetes cluster and inject faults.

chaos-engineering chaos-testing helm kubernetes reliability

Last synced: 24 Feb 2026

https://github.com/steadybit/extension-datadog

A Steadybit check implementation for data exposed through Datadog.

chaos-engineering chaos-testing check datadog monitoring observability probe

Last synced: 21 Feb 2026

https://github.com/mikhailknyazev/kube-course

Main resources for Udemy course "Configuring Kubernetes for Reliability with LitmusChaos"

chaos-engineering eks helm kubernetes litmuschaos pipelines reliability terraform

Last synced: 15 Apr 2025

https://github.com/grafana/xk6-disruptor-demo

A complete demo of xk6-disruptor

chaos-engineering k6 testing

Last synced: 19 Oct 2025

https://github.com/steadybit/extension-prometheus

A Steadybit check implementation to gather and verify the result PromQL queries.

chaos-engineering chaos-testing kubernetes monitoring prometheus promql

Last synced: 10 Mar 2026

https://github.com/steadybit/extension-postman

A Steadybit extension to execute Postman collections via Postman Cloud Api

chaos-engineering chaos-testing cloud postman

Last synced: 10 Mar 2026

https://github.com/gittorre/cloudbedlamlinuxn

CloudBedlam for Linux -- Native (C++) Impl: CloudBedlam is a simple, configurable, machine-local chaotic operation orchestrator for resiliency experimentation inside virtual and physical machines. This version is for Linux machines.

c chaos chaos-engineering cpp linux resiliency virtual-machine

Last synced: 19 Apr 2025

https://github.com/azure/platform-chaos-api

An API for introducing chaos into Azure PaaS offerings using configurable extensions 🌩

azure chaos-engineering nodejs

Last synced: 07 Oct 2025

https://github.com/denniszielke/resilient-cloud-apps

Demonstrating resilient application patterns in the Azure cloud

application-insights azure chaos-engineering kubernetes

Last synced: 06 Apr 2025

https://github.com/mlafeldt/sysrq

Go client to the Linux SysRq interface

chaos-engineering golang linux sysrq

Last synced: 02 Apr 2025

https://github.com/zeljkox/react-native-pseudo-localization

React Native Pseudo Localization is small package that enables pseudolocalization

chaos-engineering i10n i18n internationalization language pseudo pseudo-localization react-native

Last synced: 22 Jun 2025

https://github.com/chaostoolkit-incubator/chaostoolkit-google-cloud

[DEPRECATED] Chaos Extension for the Google Cloud Engine platform

chaos-engineering chaostoolkit chaostoolkit-extension gce google-cloud

Last synced: 22 Apr 2025

https://github.com/techprimers/defensive-demo

Defensive Demo to test Chaos Engineering using Chaos Toolkit and Chaos Monkey for Spring Boot

chaos-engineering chaos-monkey chaos-monkey-spring-boot chaos-toolkit defensive-demo spring-boot

Last synced: 12 Oct 2025

https://github.com/buraksenyurt/resistance

It is a library containing functions that create a chaotic environment for durability tests in web app environments.

chaos-engineering distributed-systems dotnet-core resiliency

Last synced: 10 Apr 2025

https://github.com/steadybit/reliability-hub-db

Database containing the content for Steadybit's Reliability Hub

chaos-engineering chaos-testing database reliability resilience steadybit

Last synced: 14 Apr 2025

https://github.com/chaostoolkit/chaostoolkit-extension-template

Template starting point for a new Python-based Chaos Toolkit extension

chaos-engineering chaostoolkit

Last synced: 01 Sep 2025

https://github.com/krokz/ansible-doom

Deploy Ansible configurations by killing enemies in DOOM!

ansible chaos-engineering doom

Last synced: 18 May 2026

https://github.com/upgundecha/chaos-testing-demo

This is an example project prepared for demonstrating Chaos Engineering experiment on a Spring boot application using Chaos Monkey and ChaosToolkit

chaos-engineering chaos-monkey chaos-test chaos-testing chaostoolkit spring-boot

Last synced: 30 Jul 2025

https://github.com/garugaru/flaw

CLI for injecting failures on api calls for local chaos engineering

chaos chaos-engineering failure-injection

Last synced: 27 Jun 2025

https://github.com/litmuschaos/backstage-plugin

Backstage plugin where you can find all the information you need to run chaos engineering on Litmus in one place

backstage backstage-plugin chaos-engineering litmus litmuschaos

Last synced: 03 Mar 2026

https://github.com/dgzlopes/traefik-fault-injection

Traefik Middleware Plugin - Fault Injection via HTTP headers

chaos-engineering delay failures fault-injection traefik traefik-plugin

Last synced: 03 Apr 2025

https://github.com/localstack-samples/sample-chaos-serverless-multi-region-failover

Demonstrates chaos engineering in a multi-region serverless application using API Gateway, Lambda, DynamoDB, and Route53. Test resilience, automated failover, and data integrity with LocalStack's Chaos API.

chaos-engineering chaos-testing dynamodb failover lambda localstack-sample-app multi-region resiliency-engineering route53 serverless

Last synced: 08 Sep 2025

https://github.com/TrollScripts/cpu-troll

Part of the larger OpenChaos Suite, dedicated to raising CPU latency by the requested percentage and timespan.

chaos chaos-engineering cpu-load golang

Last synced: 09 Jul 2025

https://github.com/mattyopon/faultray

Zero-risk infrastructure chaos simulation β€” 5 engines, 2000+ scenarios, 3-Layer availability proof. No production fault injection.

availability chaos-engineering devops infrastructure python resilience simulation sre

Last synced: 02 Apr 2026

https://github.com/richardbolt/shrike

The Shrike is a Chaos HTTP/WebSocket proxy that impales victims on the Tree of Pain by selectively routing traffic with path based rules via Toxiproxy for TCP level mischief

chaos chaos-engineering go http-proxy

Last synced: 16 Aug 2025

https://github.com/iskorotkov/chaos-scheduler

Service for automatic generation and scheduling of the chaos test workflows

backend chaos-engineering docker frontend go kubernetes web

Last synced: 09 Apr 2026

https://github.com/ibra-kdbra/math-problems

Math problems I found nice enough to implement, beginning with easy problems, then I will add some intermediate, advanced problems.

ai art chaos-engineering chaos-theory lorenz-attractor mandelbrot-set math randomization sfml2

Last synced: 01 May 2025

https://github.com/copyleftdev/rustocache

πŸ¦€ The Ultimate High-Performance Caching Library for Rust - 100x faster than JavaScript alternatives with sub-microsecond latencies, memory safety, and chaos engineering

async benchmarking cache caching chaos-engineering high-performance lru memory-safety performance redis rust simd stampede-protection tokio zero-copy

Last synced: 20 Apr 2026

https://github.com/sotirisalfonsos/chaos-master

The chaos master is part of a chaos project and provides an api to send fault injections to the chaos bots

chaos chaos-engineering chaos-master

Last synced: 14 Jan 2026

https://github.com/steadybit/event-kit

The Steadybit EventKit enables extensions to consume Steadybit events (similar to web hooks).

chaos-engineering chaos-testing experiment extension steadybit tags webhook

Last synced: 25 Jun 2025

https://github.com/chaos-mesh/tidb-chaos

Recording the method, process, and results of chaos engineering testing of TiDB by using Chaos Mesh.

chaos-engineering chaos-mesh tidb

Last synced: 21 Jan 2026

https://github.com/josephwoodward/chaosproxy

A simple means of introducing chaos into your infrastructure for Windows, Mac and Linux.

chaos-engineering go golang

Last synced: 20 Jul 2025

https://github.com/cemakan/pastaay

Cloud native chaos engineering toolkit for Go. Inject faults across HTTP, gRPC, Kafka, Redis, SQL, and MongoDB. YAML defined policies, multi model Oracle AI (DeepSeek, Gemini, Anthropic, OpenAI), built in blast radius guard, and a drag and drop web console with live telemetry.

aiops chaos-engineering deepseek fault-injection gitops golang kafka kubernetes-operator llm mongodb observability prometheus rabbitmq resilience telemetry yaml

Last synced: 29 May 2026

https://github.com/steadybit/action-kit

The Steadybit ActionKit enables the extension of Steadybit with new action capabilities that you can use within experiments.

attack chaos-engineering experiment extension healthcheck load-testing steadybit

Last synced: 14 Apr 2025

https://github.com/waldekmastykarz/dev-proxy-aspire

Use Dev Proxy extensions for .NET Aspire to seamlessly integrate Dev Proxy into your distributed applications.

api-testing aspire chaos-engineering dev-proxy developer-tools development devtools http microsoft-365 mock-server openapi proxy resilience rest

Last synced: 15 May 2025

https://github.com/adamdecaf/badnet

Simulating bad networks without injecting net.Conn

chaos-engineering fault-injection network

Last synced: 08 Nov 2025

https://github.com/pixelfactoryio/crashlooper

A simple container that crash πŸ’₯ after a set amount of time ⏰

chaos-engineering

Last synced: 01 Mar 2026

https://github.com/iskorotkov/chaos-workflows

Service providing REST API for working with Argo workflows

chaos-engineering docker go kubernetes

Last synced: 10 Apr 2026

https://github.com/chaosync-org/awesome-ai-agent-testing

πŸ€– A curated list of resources for testing AI agents - frameworks, methodologies, benchmarks, tools, and best practices for ensuring reliable, safe, and effective autonomous AI systems

agent-evaluation agentic-ai ai-agents ai-benchmark ai-safety artificial-intelligence awesome-list benchmark chaos chaos-engineering chaos-monkey evaluation llm llm-evaluation machine-learning qa quality-assurance testing testing-tools

Last synced: 29 Jun 2025

https://github.com/sysaulab/libseedy

Portable seedless random generator

chaos chaos-engineering chaos-theory prng rng

Last synced: 10 Mar 2026

https://github.com/arun0009/go-resilience-mock

Chaos engineering in a box. A high-performance mock server to test your API's resilience against latency, failures, and resource exhaustion

chaos-engineering cpu-stress fault-injection go golang http-mock mock-server observability prometheus resilience-testing sre

Last synced: 13 Jan 2026

https://github.com/chaostoolkit-incubator/chaostoolkit-instana

Extension for working with Instana (https://www.instana.com/) from the Chaos Toolkit

chaos-engineering chaostoolkit chaostoolkit-extension

Last synced: 01 May 2026

https://github.com/mattermost-community/mattermost-app-chaosengine

An integration with Mattermost to run Chaos GameDays

chaos-engineering

Last synced: 14 Apr 2025

https://github.com/k0rventen/macaque

a kubernetes chaos monkey implementation in go

chaos-engineering chaos-monkey golang kubernetes

Last synced: 17 May 2026

https://github.com/steadybit/extension-container

A Steadybit extension for container based actions (discovery / attacks)

attack chaos-engineering container fault helm kubernetes network process stress

Last synced: 13 May 2026

https://github.com/florextech/chaos-internet-simulator

Break your network before your users do. Local chaos proxy + dashboard + CLI for testing unstable internet and APIs.

chaos-engineering developer-tools docker fastify open-source proxy react typescript

Last synced: 02 May 2026

https://github.com/gunnargrosch/failure-cloudfunctions

Module for failure injection into Cloud Functions

chaos chaos-engineering cloudfunctions failure failure-injection gcp google

Last synced: 12 Jan 2026

https://github.com/mendrugory/pod-chaos-monkey

Pod chaos monkey is a PoC of a chaos engineering for Kubernetes which will help us to test the reliability of our system. It will killed pod, in a desired namespace in a schedule.

chaos chaos-engineering chaos-monkey kubernetes

Last synced: 27 Apr 2026

https://github.com/steadybit/extension-host

A Steadybit extension for host based actions (discovery / attacks)

attack chaos-engineering fault helm host kubernetes network process stress timetravel

Last synced: 12 Jun 2026

https://github.com/iskorotkov/chaos-framework

Chaos Framework is a platform for easy resilience testing in Kubernetes

argo chaos-engineering docker kubernetes litmus

Last synced: 24 Apr 2026

https://github.com/steadybit/extension-gcp

A Steadybit discovery and action implementation to inject faults into various Google Cloud services.

attack chaos chaos-engineering gcp google google-cloud google-cloud-platform resilience vm

Last synced: 20 Apr 2026

https://github.com/gsscoder/chaos-appservices

This repository is meant to share my chaos engineering experiments to achieve knowledge in designing resilient systems.

azure chaos-engineering csharp demo dotnet

Last synced: 17 Apr 2026

https://github.com/sbalnojan/ai-chaos-awesome

Awesome list for AI chaos engineering: experiments, evaluations, guardrails & observability for LLM/RAG.

ai-chaos-engineering awesome awesome-list chaos-engineering evals llm mlops rag red-teaming reliability

Last synced: 07 Sep 2025

https://github.com/at15/bop-bpf

Box of pain implementation using eBPF

chaos-engineering

Last synced: 11 Jun 2025

https://github.com/steadybit/steadybit-debug

steadybit-debug collects data from installed Steadybit platforms and agents to aid in customer support

chaos-engineering debugging steadybit

Last synced: 14 Apr 2025

https://github.com/kinfinity/distributed-resilience

Patterns for building Resilient Distributed Systems in Golang

chaos-engineering distributed-systems golang-module resilience

Last synced: 01 Feb 2026

https://github.com/konstruktoid/disruella

A very small digitalized primate responsible for randomly preventing something from continuing as usual or as expected.

chaos-engineering hacktoberfest high-availability python-black python3 resilience sre systemd test-automation

Last synced: 16 Feb 2026

https://github.com/geekxflood/amctl

amctl is a command line to interact with AlertManager API

alertmanager api chaos-engineering go golang observability prometheus testing testing-tools

Last synced: 11 Mar 2026

https://github.com/karimsa/frenzie

Collection of chaos hooks for your Node.js programs.

chaos-engineering hooks javascript library testing

Last synced: 24 Oct 2025

https://github.com/githubfoam/gremlin-githubactions

gremlin githubactions chaos engineering devsecops site-reliability-engineering observability

chaos-engineering devsecops githubactions gremlin observability site-reliability-engineering

Last synced: 23 Feb 2026

https://github.com/steadybit/extension-azure

A Steadybit discovery and action implementation to inject faults into various Azure services.

attack azure chaos chaos-engineering kubernetes-services resilience scalesets virtual-machine vm

Last synced: 21 Feb 2026

https://github.com/jsabo/aws-lambda-failure-flag-app

This project demonstrates how to integrate Gremlin Failure Flags into an AWS Lambda function, enabling you to simulate injected latency and exceptions while measuring processing performance. It’s a serverless demo for testing resiliency and fault injection in real-world cloud environments.

aws chaos-engineering gremlin lambda performance-testing reliability-engineering

Last synced: 08 Jul 2025

https://github.com/ngvuthdanhh/certificate-chaos-engineering-practitioner-gremlin

Notes, labs, research, and certificate for the Chaos Engineering Practitioner Gremlin program. Learn chaos principles, failure injection, observability, and resilience patterns.

chaos-engineering cloud-security distributed-systems fault-injection githublearning gremlin observability reliability resilience

Last synced: 25 Jan 2026

https://github.com/officialasishkumar/streamforge

Self-hosted real-time event analytics platform β€” Kafka, Postgres, S3 archive, full chaos and perf regression suite

analytics chaos-engineering distributed-systems event-driven event-streaming golang kafka kubernetes observability postgresql

Last synced: 17 May 2026

https://github.com/iskorotkov/chaos-muxy

Experiment with Muxy proxy for causing network failures in target apps

chaos-engineering docker golang muxy

Last synced: 17 May 2026

https://github.com/iskorotkov/chaos-client

Simple app used in chaos testing as a potential target. It can ping specified server at a fixed rate using /counter endpoint

chaos-engineering docker golang

Last synced: 18 May 2026