Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-distributed-system-projects
🚀 List of distributed system projects for inspiration and learning to build distributed services from real world examples
https://github.com/roma-glushko/awesome-distributed-system-projects
Last synced: about 4 hours ago
JSON representation
-
Databases
-
- Jocko - a Kafka/distributed commit log service in Go. [Serf + Raft
- oklog - a distributed and coordination-free log management system for big ol' clusters [Archived
- elasticell - a distributed HA Redis-compatible NoSQL database with strong consistency and reliability
- Apache HBase - a Hadoop database, a distributed, scalable, big data store
- Tair - a high-performance and high-availability distributed fast-access memory (MDB)/persistent (LDB) storage service
- immudb - an immutable database based on zero trust, Key/Value & SQL, tamperproof, data change history
- CouchDB - a highly available, partition tolerant, eventually consistent document database - master setups with automatic conflict detection.
- toydb - distributed SQL database in Rust, written as a learning project
- DB3 Network - a decentralized firebase firestore alternative
- ZODB - an ACID transactional object-oriented database
- requiemdb - a permanent storage for OTEL data
-
Key-Value Databases
- memcached - a high performance multithreaded event-based key/value cache store intended to be used in a distributed system
- redis - an in-memory database with various value types that persists on disk
- TiKV - a distributed transactional key-value database, originally created to complement TiDB
- leveldb - a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values
- goleveldb - a LevelDB implemented in Golang
- summitdb - an in-memory, NoSQL key/value database. It persists to disk, uses the Raft consensus algorithm, is ACID compliant, and built on a transactional and strongly-consistent model - defined JS scripting
- pupdb - a simple file-based key-value database
- pickledb - an open source key-value store using Python's json module
- KeyDB - a faster drop-in multithreaded alternative to Redis
- Dragonfly - an in-memory data store fully compatible with Redis and Memcache and designed using modern algorithms
- BadgerDB - an embeddable, persistent and fast key-value (KV) database written in pure Go
- BuntDB - a low-level, in-memory, key/value store in pure Go.
- ConstDB - a redis-like cache store that implements CRDTs and active-active replications.
- GhostDB - a distributed, in-memory, general purpose key-value data store that delivers microsecond performance at any scale
- Hive - a lightweight and blazing fast key-value database written in pure Dart. Inspired by Bitcask
- PumpkinDB - an immutable Ordered Key-Value Database Engine
- FlashDB - a simple, in-memory, key/value store in pure Go.
- Lazer - a PHP flat file database based on JSON files
- Scribble - a tiny JSON database in Golang
- FlyDB - a high-performance KV storage engine based on bitcask paper supports redis protocol and the corresponding data structure
- Engula - a distributed key-value store, used as a cache, database, and storage engine
- Dice - an extremely simple Golang-based in-memory KV store that speaks Redis dialect
-
Relational, SQL, NewSQL Databases
- CockroachDB - a distributed fault-tolerant SQL database built on a transactional and strongly-consistent key-value store
- YugabyteDB - a cloud native distributed SQL database for mission-critical applications
- RQLite - a lightweight, distributed relational database, which uses SQLite as its storage engine
- Kingbus - a distributed MySQL binlog store based on raft [Raft
- YDB is an open-source Distributed SQL Database that combines high availability and scalability with strict consistency and ACID transactions
- RadonDB - an open source, Cloud-native MySQL database for unlimited scalability and performance
-
NoSQL, Document Databases
- MongoDB - document database designed for ease of development and scaling
- FerretDB - an proxy, converting the MongoDB 6.0+ wire protocol queries to SQL - using PostgreSQL as a database engine
- LiteDB - NoSQL Document Store in a single data file
- tinydb - a lightweight document oriented database written in pure Python
- SleekDB - a simple flat file NoSQL like database implemented in PHP without any third-party dependencies that store data in plain JSON files
- BonsaiDB - an ACID, transactional KV or document dev-friendly database with configurable delayed on-disk data storing
- CloverDB - a lightweight document-oriented NoSQL database written in pure Golang
- SleekDB - a simple flat file NoSQL like database implemented in PHP without any third-party dependencies that store data in plain JSON files
-
Graph Databases
-
Time Series
- VictoriaMetrics - fast, cost-effective monitoring solution and time series database
- influxdb - scalable datastore for metrics, events, and real-time analytics
- trino - fast distributed SQL query engine for big data analytics
- Apache Doris - an easy-to-use, high performance and unified analytics database
- tstorage is a lightweight local on-disk storage engine for time-series data with a straightforward API
- CnosDB is a high-performance, high-compression, and easy-to-use open-source distributed time-series database. Used in fields such as IoT, industrial internet, connected cars, and IT operations
- LinDB - a scalable, high performance, high availability distributed time series database
- FiloDB - a distributed, prometheus-compatible, real-time, in-memory, massively scalable, multi-schema time series / event / operational database
-
Column Databases
-
Permission Databases
-
Analytical Databases
-
Vectors
- Qdrant - a vector similarity search engine and vector database
- milvus - an open-source vector database built to power embedding similarity search and AI applications
- Weaviate - an open source vector database that stores both objects and vectors
- tobias-mayer/vector-db - a simple vector database that can be used to search for similar vectors in logarithmic time
- DANNY - a decentralized vector database for building vector search applications
-
-
Gateways
-
Vectors
- Glide - an open reliable fast LLM/model gateway for rapid development of GenAI apps
- Traefik - a cloud-native app proxy
- Kong - a cloud-native feature-rich API gatewat
- Skipper - an HTTP router and reverse proxy for service composition
- janus - a lightweight API gateway and management platform
- Lura - ultra performance API gateway with middlewares
- MLFLow Gateway - an LLM proxy
-
-
Locking
-
Streaming
-
Vectors
- Kafka - a distributed, highly scalable, elastic, fault-tolerant, and secure event streaming platform
- faust - a distributed stream processing library that ports the ideas from Kafka Streams to Python
- Liftbridge - a lightweight, fault-tolerant message streams by implementing a durable stream augmentation for the NATS messaging system
- RisingWave - a distributed SQL database for stream processing, designed to reduce the complexity and cost of building real-time applications
-
-
Schedulers
-
Queues
-
Vectors
- nsq - realtime fault tolerant distributed messaging platform designed to operate at scale, handling billions of messages per day [Raft + gRPC
- Sandglass - distributed, horizontally scalable, persistent, time ordered message queue
- dnpipes - distributed version of Unix named pipes comparable to AWS SQS
- GatewayWorker - distributed realtime messaging framework based on workerman
- ZeroMQ - abstraction of asynchronous message queues, multiple messaging patterns, message filtering (subscriptions), seamless access to multiple transport protocols and more
- Apache Pulsar - distributed pub-sub messaging platform with a very flexible messaging model and an intuitive client API
- Apache ActiveMQ - high performance Apache 2.0 licensed Message Broker
-
-
Search Engines
-
Vectors
- ElasticSearch - distributed, RESTful search and analytics engine
- Apache Lucene - a high-performance, full featured text search engine library
- MeiliSearch - Lightning Fast, Ultra Relevant, and Typo-Tolerant Search Engine
- FlexSearch - memory-flexible full-text search library
- RiotSearch - distributed, Simple and efficient full text search engine
- Typesense - fast, typo tolerant, fuzzy search engine
- Sonic - fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM
-
-
File Systems
-
Vectors
- JuiceFS - Hadoop-compatible AWS S3-compatible high-performance POSIX file system
- GlusterFS - distributed storage that can scale to several petabytes
- GlusterFS - highly reliable, scalable and efficient distributed file system
- SeaweedFS - a simple Hadoop-compatible AWS S3-compatible distributed highly scalable distributed file system
-
-
Service Discovery
-
Data Processing
-
TerminusDB
-
OS
-
Frameworks
-
Vectors
- gosiris - an actor framework for Golang
- cotyledon - a framework for defining long-running services - notify, rate limiting for worker spawning, and more.
- atomix - fully featured framework for building fault-tolerant distributed systems [REST + Raft
- orbit - virtual actor framework for building distributed systems
- hemera - A Node.js microservices toolkit for the NATS messaging system [RPC
- Tooz - centralizing the most common distributed primitives like group membership protocol, lock service and leader election by providing a coordination API helping developers to build distributed applications
- Nebula - powerful framework for building highly concurrent, distributed, and resilient message-driven applications
- Service Weaver - A framework that allows to write applications as modular binary and deploy it as a set of microservices
- Dapr - portable, serverless, event-driven runtime that works as a sidecar and makes it easy for developers to build resilient, stateless and stateful microservices
-
-
Components
-
Vectors
- Dragonboat - a high performance multi-group Raft consensus library in pure Go
- Golimit - Uber ringpop based distributed and decentralized rate limiter
- Tenacity - general-purpose retrying library
- ex_hash_ring - pure Elixir consistent hash ring implementation based on the excellent C hash-ring lib
- raft - Raft consensus implementation
- NuRaft - Raft implementation derived from the cornerstone project
- Hyx - Lightweight fault tolerance primitives for your resilient and modern Python microservices
- Migdalor - a Kubernetes native peer discovery for Python asyncio nodes
- skiplist - a Golang implementation of the skiplist data structure
- Waltz - a quorum-based distributed write-ahead log for replicating transactions
- raft - Raft consensus implementation
-
-
Other Resources
-
Vectors
- awesome-scalability - Reading list for illustrating the patterns of scalable, reliable, and performant large-scale systems
- awesome-distributed-systems - curated list on awesome material on distributed systems
- awesome-database-learning - a list of learning materials to understand databases internals
- Writing a sqlite clone from scratch in C
- Build Your Own Redis with C/C++
- Berkley CS186: Intro into Database Systems
- MIT 6.830: Database Systems
-
Programming Languages
Categories
Sub Categories
Keywords
database
63
distributed-systems
36
go
28
golang
25
raft
14
key-value
14
nosql
13
sql
13
distributed-database
13
rust
11
python
10
kubernetes
10
search-engine
9
json
9
cloud-native
9
java
9
distributed
8
redis
8
consensus
8
kafka
7
search
7
vector-database
6
big-data
6
messaging
6
cpp
6
storage
6
vector-search
6
microservice
5
in-memory
5
high-performance
5
pubsub
5
mlops
4
time-series
4
metrics
4
full-text-search
4
cache
4
etcd
4
graph
4
nearest-neighbor-search
4
graph-database
4
streaming
3
real-time
3
high-availability
3
serverless
3
ai
3
microservices
3
replication
3
hive
3
grpc
3
similarity-search
3