Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with fault-tolerance
A curated list of projects in awesome lists tagged with fault-tolerance .
https://github.com/InterviewReady/system-design-resources
These are the best resources for System Design on the Internet
cache fault-tolerance scalability system-design
Last synced: 26 Oct 2024
https://github.com/mesos/chronos
Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules
chronos chronos-scheduler cron cron-jobs crontab fault-tolerance job-scheduler mesos mesos-framework mesosphere scheduled-jobs
Last synced: 19 Dec 2024
https://github.com/distribworks/dkron
Dkron - Distributed, fault tolerant job scheduling system https://dkron.io
cron distributed-systems fault-tolerance scheduled-jobs
Last synced: 17 Dec 2024
https://github.com/shunfei/cronsun
A Distributed, Fault-Tolerant Cron-Style Job System.
cron crontab fault-tolerance go golang job-scheduler
Last synced: 05 Nov 2024
https://github.com/bastion-rs/bastion
Highly-available Distributed Fault-tolerant Runtime
actor asynchronous concurrency distributed distributed-systems fault-tolerance hacktoberfest numa-aware rust rust-language smp supervision
Last synced: 18 Dec 2024
https://github.com/heidihoward/distributed-consensus-reading-list
A list of papers about distributed consensus.
atomic-broadcast-protocol consensus-algorithms consistency distributed-computing distributed-systems fault-tolerance paxos quorum-systems replication state-machine-replication zookeeper
Last synced: 02 Dec 2024
https://heidihoward.github.io/distributed-consensus-reading-list/
A list of papers about distributed consensus.
atomic-broadcast-protocol consensus-algorithms consistency distributed-computing distributed-systems fault-tolerance paxos quorum-systems replication state-machine-replication zookeeper
Last synced: 17 Nov 2024
https://github.com/polarismesh/polaris
Service Discovery and Governance Platform for Microservice and Distributed Architecture
authenticate circuit-break fault-tolerance health-check load-balance microservice polaris polarismesh rate-limit service-discover service-governance service-register servicemesh traffic-control
Last synced: 02 Nov 2024
https://github.com/anthdm/hollywood
Blazingly fast and light-weight Actor engine written in Golang
actor-model distributed-systems fault-tolerance golang microservices
Last synced: 19 Dec 2024
https://github.com/kraken-php/framework
Asynchronous & Fault-tolerant PHP Framework for Distributed Applications.
async async-library asynchronous fault-tolerance microservices multi-process multi-thread php service-oriented stream supervision
Last synced: 05 Nov 2024
https://github.com/lizardfs/lizardfs
LizardFS is an Open Source Distributed File System licensed under GPLv3.
c-plus-plus distributed-computing distributed-systems erasure-coding fault-tolerance geo-replication gplv3 hierarchical-storage high-availability high-performance hsm linux lizardfs macosx nas posix qos replicas replication software-defined-storage
Last synced: 21 Dec 2024
https://github.com/bakwc/PySyncObj
A library for replicating your python class between multiple servers, based on raft protocol
distributed-systems fault-tolerance python raft raft-protocol replication
Last synced: 18 Nov 2024
https://github.com/Tencent/TSeer
A high available service discovery & registration & fault-tolerance framework
fault-tolerance microservice service-discovery service-registry
Last synced: 08 Nov 2024
https://github.com/tencent/tseer
A high available service discovery & registration & fault-tolerance framework
fault-tolerance microservice service-discovery service-registry
Last synced: 20 Dec 2024
https://github.com/riot-ml/riot
An actor-model multi-core scheduler for OCaml 5 🐫
actor-model elixir erlang fault-tolerance multicore multicore-ocaml ocaml otp
Last synced: 21 Dec 2024
https://github.com/ackintosh/ganesha
:elephant: A Circuit Breaker pattern implementation for PHP applications.
circuit-breaker circuit-breaker-pattern fault-tolerance guzzle guzzle-middleware microservice microservices php
Last synced: 15 Dec 2024
https://github.com/Polly-Contrib/Simmy
Simmy is a chaos-engineering and fault-injection tool, integrating with the Polly resilience project for .NET
chaos-engineering fault-based-testing fault-injection fault-tolerance fault-tolerant polly-resilience resilience resilience-testing resiliency resilient
Last synced: 09 Nov 2024
https://github.com/golemcloud/golem
Golem is an open source durable computing platform that makes it easy to build and deploy highly reliable distributed systems.
component-model distributed-systems durable-computing durable-execution fault-tolerance high-reliability serverless wasi wasm wasm-component
Last synced: 19 Dec 2024
https://github.com/platformlab/ramcloud
**No Longer Maintained** Official RAMCloud repo
dram fault-tolerance key-value low-latency storage
Last synced: 15 Dec 2024
https://github.com/PlatformLab/RAMCloud
**No Longer Maintained** Official RAMCloud repo
dram fault-tolerance key-value low-latency storage
Last synced: 17 Nov 2024
https://github.com/ChrisWhealy/DistributedSystemNotes
Notes on Lindsey Kuper's lectures on Distributed Systems
consensus-protocol distributed-systems fault-tolerance lamport-clock paxos-protocol vector-clock
Last synced: 08 Nov 2024
https://github.com/CloudI/CloudI
A Cloud at the lowest level!
actor-model cloud cloud-computing cloudi concurrency distributed-actors distributed-systems fault-tolerance functional-reactive-programming microservices reactive scalability soa
Last synced: 20 Nov 2024
https://github.com/cloudi/cloudi
A Cloud at the lowest level!
actor-model cloud cloud-computing cloudi concurrency distributed-actors distributed-systems fault-tolerance functional-reactive-programming microservices reactive scalability soa
Last synced: 21 Dec 2024
https://github.com/infinit/infinit
The Infinit policy-based software-defined storage platform.
block-device decentralized distributed fault-tolerance filesystem fuse infinit iscsi key-value ldap nbd nfs object-storage peer-to-peer platform policies resilience s3 scalability storage
Last synced: 02 Dec 2024
https://github.com/awolden/brakes
Hystrix compliant Node.js Circuit Breaker Library
circuit-breaker circuit-breaker-library fault-tolerance health-check hystrix hystrix-dashboard
Last synced: 16 Dec 2024
https://github.com/artilleryio/chaos-lambda
Serverless chaos monkey for AWS (runs on AWS Lambda) ☁️ 💥
aws chaos-monkey fault-tolerance reliability-engineering
Last synced: 29 Nov 2024
https://github.com/haraldng/omnipaxos
OmniPaxos is a distributed log implemented as a Rust library.
consensus distributed-systems fault-tolerance paxos raft replication rust
Last synced: 30 Oct 2024
https://github.com/hhblaze/raft.net
Implementation of RAFT distributed consensus algorithm among TCP Peers on .NET / .NETStandard / .NETCore / dotnet
c-sharp consensus-algorithm core csharp dbreeze distributed dotnet-core dotnet-standard dotnetcore fault-tolerance high-availability net netcore netstandard peer raft raft-consensus raft-server redundancy tcp
Last synced: 27 Oct 2024
https://github.com/svroonland/rezilience
ZIO-native utilities for making resilient distributed systems
bulkhead circuit-breaker fault-tolerance scala zio
Last synced: 18 Nov 2024
https://github.com/Polly-Contrib/Polly.Contrib.WaitAndRetry
Polly.Contrib.WaitAndRetry is an extension library for Polly containing helper methods for a variety of wait-and-retry strategies.
dotnet fault-handler fault-tolerance jitter-formula jitter-recommendation polly resilience resiliency resiliency-patterns retry-intervals retry-pattern retry-policies retry-strategies transient-fault-handling
Last synced: 14 Nov 2024
https://github.com/irrustible/async-backplane
Simple, Erlang-inspired fault-tolerance framework for Rust Futures.
async async-await asynchronous erlang-otp fault-detection fault-tolerance recovery reliability reliability-backplane rust
Last synced: 18 Dec 2024
https://github.com/alejandro-du/vaadin-microservices-demo
A microservices example developed with Spring Cloud and Vaadin
demo eureka fault-tolerance high-availability java load-balancing microservice netflix-ribbon spring-boot vaadin zuul
Last synced: 16 Nov 2024
https://github.com/eBay/Gringofts
Gringofts makes it easy to build a replicated, fault-tolerant, high throughput and distributed event-sourced system.
cpp distributed event-sourcing fault-tolerance raft-consensus
Last synced: 08 Nov 2024
https://github.com/bastion-rs/artillery
Fire-forged cluster management & Distributed data protocol
bastion distributed-systems fault-tolerance protocol
Last synced: 19 Dec 2024
https://github.com/k8snetworkplumbingwg/bond-cni
Bond-cni is for fail-over and high availability of networking in cloudnative orchestration
active-backup alb bonding-cni cni failover fault-tolerance high-availability interface-bonding link-aggregator load-balancing networking tlb
Last synced: 21 Nov 2024
https://github.com/vinelab/http
A smart, simple and fault-tolerant HTTP client for sending and receiving JSON and XML
fault-tolerance http http-client laravel php
Last synced: 17 Dec 2024
https://github.com/theodesp/stable-systems-checklist
An opinionated list of attributes and policies that need to be met in order to establish a stable software system.
architecture continuous-delivery continuous-integration fault-tolerance reliability-engineering security
Last synced: 08 Dec 2024
https://github.com/bastion-rs/fort
Proc macro attributes for Bastion runtime.
bastion fault-tolerance fort proc-macro proc-macro-attributes rust rust-lang
Last synced: 13 Nov 2024
https://github.com/reugn/kotlin-backoff
An exponential backoff library for Kotlin
backoff exponential-backoff fault-tolerance kotlin kotlin-library retry retry-library retry-strategies
Last synced: 16 Nov 2024
https://github.com/hyperion-cs/dhaf
Distributed high availability failover, written in cross-platform C# .NET (Linux, Windows and macOS supported).
cloudflare dhaf distributed dns fault-tolerance google-cloud health-check high-availability notifications web
Last synced: 06 Dec 2024
https://github.com/leondavi/NErlNet
Nerlnet is a distributed machine learning platform for experiments and IoT deployment.
ai artificial-intelligence-projects cowboy distributed-machine-learning distributed-ml distributed-systems erlang fault-tolerance federated federated-learning federated-learning-framework iot machine-learning ml nerlnet neural-network python
Last synced: 08 Nov 2024
https://github.com/myntra/cortex
A fault-tolerant events/alerts correlation engine
alerting alerts alerts-correlation-engine boltdb cloudevents correlation event-gateway events events-correlation-engine faas fault-tolerance incident-classification incidents metrics raft rule-engine rules rules-engine
Last synced: 28 Nov 2024
https://github.com/ldmtam/raft-auto-increment
Distributed, fault-tolerant, persistent, auto-increment ID generation service with Raft consensus
autoincrement badger bitcask distributed-systems fault-tolerance golang raft-consensus
Last synced: 29 Nov 2024
https://github.com/andyobtiva/abstract_feature_branch
abstract_feature_branch is a Ruby gem that provides a variation on the Branch by Abstraction Pattern by Paul Hammant and the Feature Toggles Pattern by Martin Fowler (aka Feature Flags) to enable Continuous Integration and Trunk-Based Development.
branch-by-abstraction continuous-integration fault-tolerance feature-flags feature-flipper feature-flipping feature-rollout feature-toggles productivity rails release-management ruby trunk-based-development
Last synced: 17 Dec 2024
https://github.com/medavox/mutime
NTP time syncing library for Android
android-library fault-tolerance ntp ntp-client sntp time truetime
Last synced: 31 Oct 2024
https://github.com/gdanezis/pybft
Experiments with pBFT
byzantine consensus-algorithm cryptography distributed-systems fault-tolerance security
Last synced: 13 Oct 2024
https://github.com/andrewlee302/MIT-6.824
Implementation of MIT 6.824: Distributed Systems
course-project distributed-systems fault-tolerance golang kvstore lab paxos shard-databases
Last synced: 11 Nov 2024
https://github.com/clivern/cluster
Golang Package for System Clustering.
clivern clustering fault-tolerance hashicorp high-availability leader-election memberlist pubsub
Last synced: 16 Nov 2024
https://github.com/Clivern/Cluster
Golang Package for System Clustering.
clivern clustering fault-tolerance hashicorp high-availability leader-election memberlist pubsub
Last synced: 06 Nov 2024
https://github.com/yaroslaff/okerr-dev
Okerr hybrid (host/network) monitoring system with remote network checks, email/Telegram alerts and DynDNS fault-tolerance feature.
fault-tolerance monitoring okerr okerrupdate python telegram uptime uptime-monitor
Last synced: 07 Nov 2024
https://github.com/ks-amit/distributed-database
A travel agency app with a distributed database implemented from scratch!
bulma bulma-css distributed-database distributed-file-system distributed-storage distributed-systems django django-rest-framework fault-tolerance gfs heartbeat-service postgresql python travel-agency
Last synced: 14 Oct 2024
https://github.com/jabolina/go-mcast
Golang based implementation of the Generic Multicast protocol.
atomic-multicast fault-tolerance generic-multicast multicast state-machine-replication
Last synced: 03 Dec 2024
https://github.com/pscheidl/fortee
Jakarta EE / Java EE fault-tolerance guard leveraging the Optional pattern. Its power lies in its simplicity.
annotations cdi-extension fault-tolerance jakartaee java javaee simplicity
Last synced: 19 Nov 2024
https://github.com/sger/elixir_dropbox
Simple Dropbox v2 client for Elixir
api api-client cloud dropbox dropbox-v2 elixir elixir-dropbox elixir-lang elixir-programming-language erlang fault-tolerance filesharing
Last synced: 11 Oct 2024
https://github.com/svladykin/replicamap
Key-value Kafka Database
apache-kafka concurrent-map database fault-tolerance in-memory in-memory-database java kafka key-value multimaster replication
Last synced: 03 Dec 2024
https://github.com/cloudi/cloudi_core
CloudI internal service runtime
actor-model cloud cloud-computing cloudi concurrency distributed-actors distributed-systems elixir erlang fault-tolerance functional-reactive-programming microservices reactive scalability soa
Last synced: 08 Nov 2024
https://github.com/lenisha/torchelastic_lab
Lab to demo how to run Fault tolerant Elastic Training using Torch Elastic
aks azure-spot-vms fault-tolerance kubernetes pytorch pytorch-elastic
Last synced: 18 Oct 2024
https://github.com/hadielmougy/shield
Fault tolerance library for java
circuit-breaker fault-tolerance java rate-limiting resiliency retry throttling timeout
Last synced: 09 Nov 2024
https://github.com/samueleresca/phi-accrual-failure-detector
Phi φ Accrual Failure Detector implementation in Python, see: https://samueleresca.net/detecting-node-failures-and-the-phi-accrual-failure-detector/
accrual-failure-detector distributed-systems fault-detection fault-tolerance fault-tolerant
Last synced: 11 Nov 2024
https://github.com/rads/rsdp
🐢 [Work in Progress] Implementations of algorithms from Introduction to Reliable and Secure Distributed Programming
async clojure distributed-systems fault-tolerance networking rsdp
Last synced: 14 Dec 2024
https://github.com/txix-open/gds
An embedded distributed task scheduler for Go
cron distributed fault-tolerance go golang job-scheduler jobscheduler quartz raft scheduler
Last synced: 13 Nov 2024
https://github.com/joshnuss/httpx
A tiny concurrent & fault-tolerant HTTP server that uses pattern matching for routing
concurrency elixir fault-tolerance http pattern-matching routing
Last synced: 13 Oct 2024
https://github.com/ksaveras/circuit-breaker
Circuit Breaker pattern implementation in PHP
circuit-breaker circuit-breaker-pattern error-handling fault-tolerance microservices package php
Last synced: 17 Nov 2024
https://github.com/clivern/bull
📦Microservices Playground with Symfony 4.
elk-stack fault-tolerance microservices queue rabbitmq rest symfony
Last synced: 16 Nov 2024
https://github.com/dayvsonlima/try.js
😎 Sexy JavaScript fault-tolerant
fault-tolerance javascript ruby
Last synced: 05 Nov 2024
https://github.com/smaato/switchgear
Switchgear is a Java library providing resilience and fault tolerance error-prone calls but not compromising the performance part.
circuit-breaker fault-tolerance
Last synced: 24 Nov 2024
https://github.com/koevskinikola/byzantinerandomizedconsensus
An implementation of a textbook Byzantine Randomized Consensus protocol, using a Byzantine Reliable Broadcast protocol
asynchronous-system broadcast byzantine consensus fault-tolerance randomized-algorithm reliable
Last synced: 22 Nov 2024
https://github.com/xudong963/distributed-systems
MapReduce, Raft, KV Raft, Shared KV
fault-tolerance golang kvs-server learn-golang mapreduce papers raft
Last synced: 07 Nov 2024
https://github.com/jhegarty14/resilience4ts
resilience4ts is a functional, distributed-first fault tolerance library for TypeScript inspired by resilience4j and Polly
bulkhead cache circuit-breaker concurrency distributed-locking distributed-systems fallback fault-tolerance hedge nestjs rate-limiter resilience retry timeout
Last synced: 02 Nov 2024
https://github.com/pytorch-labs/torchft
PyTorch per step fault tolerance (actively under development)
fault-tolerance ml pytorch torch training
Last synced: 22 Nov 2024
https://github.com/kresil/kresil
Kotlin Multiplatform fault-tolerance library with Ktor integration
circuit-breaker fault-tolerance kotlin-multiplatform kotlin-multiplatform-library ktor resilience resiliency-patterns retry-strategies
Last synced: 27 Nov 2024
https://github.com/cloudi/cloudi_service_queue
Persistent Queue Service
cloud fault-tolerance microservices
Last synced: 08 Nov 2024
https://github.com/alexkch/key-value-db
Key-Value Database with fault tolerance
c c89 fault-tolerance key-value-database
Last synced: 19 Nov 2024
https://github.com/cloudi/cloudi_api_python
Python CloudI API
actor-model cloud cloud-computing cloudi-api concurrency distributed-actors distributed-systems fault-tolerance functional-reactive-programming microservices python python2 python3 reactive scalability soa
Last synced: 08 Nov 2024
https://github.com/cloudi/website
https://cloudi.org
actor-model cloud cloud-computing cloudi concurrency distributed-actors distributed-systems fault-tolerance functional-reactive-programming microservices reactive scalability soa website
Last synced: 08 Nov 2024
https://github.com/peterkrauz/pistis
Transparent, easily-configurable, strongly-consistent distributed state-machine replicas.
distributed-systems elixir erlang fault-tolerance raft state-machine-replication
Last synced: 29 Oct 2024
https://github.com/cloudi/cloudi_api_ocaml
OCaml CloudI API
actor-model cloud cloud-computing cloudi-api concurrency distributed-actors distributed-systems fault-tolerance functional-reactive-programming microservices ocaml reactive scalability soa
Last synced: 08 Nov 2024
https://github.com/arthurkushman/faultol
Fault Tolerance component consumer to save static html files on disk from RabbitMQ
consumer fault-tolerance go golang rabbitmq rabbitmq-consumer static-site static-site-generator
Last synced: 05 Nov 2024
https://github.com/melchisedech333/verbum-language
🟣 A programming language focused on the development of complex systems. It supports the creation of systems involving the concepts of distributed computing, parallel computing, concurrent computing, meta-programming, hot code reload, high fault tolerance, and scalability.
complex-networks complex-systems concurrent-programming distributed-programming distributed-systems fault-tolerance free-and-open-source-software horizontal-scalable hot-code-reload imperative metaprogramming multiplatform multiprocessing open-source parellel-programming procedural programming-language scalable structured systems-engineering
Last synced: 08 Dec 2024
https://github.com/shifuml/shifu-tensorflow
Distributed Tensorflow on Shifu Pipeline
fault-tolerance machine-learning pipeline tensorflow yarn
Last synced: 19 Nov 2024
https://github.com/ashenblade/taskflux
Отказоустойчивый распределенный менеджер очередей сообщений с приоритетами
csharp dotnet fault-tolerance queue-manager raft raft-consensus-algorithm
Last synced: 19 Nov 2024
https://github.com/alphahydrae/elixir-brownbag
Playing with Elixir: functional, concurrent, distributed & fault-tolerant programming
brownbag concurrency elixir erlang fault-tolerance
Last synced: 19 Nov 2024
https://github.com/mwufi/decent-dungeon
Adventure peer-to-peer dungeon game Adventure
artificial-intelligence clockchain decentralized deep-learning distributed dockchain dungeon dungeon-crawler-game dungeons-and-dragons enterprise-software fault-tolerance fochchain game game-engine incremental interactive-fiction machine-learning proof-of-concept robustness terminal-based
Last synced: 07 Nov 2024
https://github.com/mattisonchao/celestial
Distributed k-v/streaming storage system
fault-tolerance kv-store streaming
Last synced: 16 Oct 2024
https://github.com/mattisonchao/efs
Easy, fast storage
distributed-systems fault-tolerance key-value streaming
Last synced: 16 Oct 2024
https://github.com/kumuluz/kumuluzee-fault-tolerance
KumuluzEE Fault Tolerance extension for implementing fault tolerance mechanisms, including circuit breakers, fallbacks, retries, timeouts and other mechanisms for decoupling microservices.
circuit-breaker cloud-native fault-tolerance java javaee kumuluzee microprofile microservices
Last synced: 05 Nov 2024
https://github.com/hmmhmmhm/stably
🌳 Implement fault tolerance in node js. (Automatic restart, migration, fault notification)
fault-tolerance javascript migration nodejs notification reboot restart
Last synced: 08 Nov 2024
https://github.com/aveek-saha/orca-strator
A container orchestration system for scalable APIs
api autosca container-orchestration docker fault-tolerance load-balancer microservices node-js scalability
Last synced: 05 Nov 2024
https://github.com/manifoldfinance/tla-spec
tla++ proofs and verifications
distributed fault-tolerance pcal tla tlaplus
Last synced: 23 Nov 2024
https://github.com/yaroslaff/sensor
sensor for all remote checks for okerr monitoring
check fault-tolerance monitoring network remote sensor
Last synced: 07 Nov 2024
https://github.com/cloudi/cloudi_service_db_mysql
MySQL CloudI Service
cloud fault-tolerance microservices
Last synced: 08 Nov 2024
https://github.com/cloudi/cloudi_service_router
CloudI Router Service
cloud fault-tolerance microservices
Last synced: 08 Nov 2024
https://github.com/cloudi/cloudi_service_http_cowboy
cowboy 2.x HTTP/HTTPS CloudI Service
cloud fault-tolerance microservices
Last synced: 08 Nov 2024
https://github.com/cloudi/cloudi_service_api_requests
CloudI Service API requests (JSON-RPC/Erlang-term support)
cloud fault-tolerance microservices
Last synced: 08 Nov 2024
https://github.com/cloudi/cloudi_service_http_cowboy1
cowboy 1.x HTTP/HTTPS CloudI Service
cloud fault-tolerance microservices
Last synced: 08 Nov 2024
https://github.com/cloudi/cloudi_api_haskell
Haskell CloudI API
actor-model cloud cloud-computing cloudi-api concurrency distributed-actors distributed-systems fault-tolerance functional-reactive-programming haskell microservices reactive scalability soa
Last synced: 08 Nov 2024
https://github.com/cloudi/cloudi_service_health_check
Health Check CloudI Service
cloud fault-tolerance microservice
Last synced: 08 Nov 2024
https://github.com/cloudi/cloudi_service_filesystem
Filesystem CloudI Service
cloud fault-tolerance microservices
Last synced: 08 Nov 2024