Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with sre

A curated list of projects in awesome lists tagged with sre .

https://github.com/bregman-arie/devops-exercises

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

ansible aws azure coding containers devops docker git interview interview-questions kubernetes linux openstack production-engineer prometheus python sql sre terraform

Last synced: 29 Sep 2024

https://github.com/n1trux/awesome-sysadmin

A curated list of amazingly awesome open-source sysadmin resources.

awesome awesome-list devops list ops self-hosted software sre sysadmin

Last synced: 30 Jul 2024

https://github.com/upgundecha/howtheysre

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)

alerting chaos-engineering dev-ops devops hacktoberfest hacktoberfest-accepted incident-management incident-response infrastructure ml-ops monitoring observability on-call post-mortem reliability security site-reliability-engineering software-engineering sre sre-culture

Last synced: 30 Sep 2024

https://github.com/linkedin/school-of-sre

At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.

git hadoop linux mysql networking nosql python security sre system-design

Last synced: 30 Sep 2024

https://github.com/runatlantis/atlantis

Terraform Pull Request Automation

atlantis automation devops go golang hacktoberfest sre tacos terraform

Last synced: 29 Sep 2024

https://github.com/mxssl/sre-interview-prep-guide

Site Reliability Engineer Interview Preparation Guide

interview-preparation preparation site-reliability-engineer sre sre-interview study

Last synced: 01 Oct 2024

https://github.com/hjacobs/kubernetes-failure-stories

Compilation of public failure/horror stories related to Kubernetes

failures incidents kubernetes post-mortem postmortem production-engineering reliability sre

Last synced: 25 Sep 2024

https://github.com/isno/theByteBook

⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue

cloud-native container devops distributed-systems finops kubernetes networking paas paxos raft service-mesh sre

Last synced: 01 Aug 2024

https://github.com/isno/thebytebook

⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue

cloud-native container devops distributed-systems finops kubernetes networking paas paxos raft service-mesh sre

Last synced: 30 Sep 2024

https://github.com/StackStorm/st2

StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 integration packs with 6000+ actions (see https://exchange.stackstorm.org) and ChatOps. Installer at https://docs.stackstorm.com/install/index.html

auto-remediation automation chatops cicd deployment devops ifttt python sre st2 stackstorm workflows

Last synced: 31 Jul 2024

https://github.com/stackstorm/st2

StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 integration packs with 6000+ actions (see https://exchange.stackstorm.org) and ChatOps. Installer at https://docs.stackstorm.com/install/index.html

auto-remediation automation chatops cicd deployment devops ifttt python sre st2 stackstorm workflows

Last synced: 29 Sep 2024

https://github.com/rundeck/rundeck

Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts

ansible audit automation deployment devops devops-team devops-tools hacktoberfest java operations ops orchestration runbook rundeck scheduler sre

Last synced: 01 Oct 2024

https://github.com/k8sgpt-ai/k8sgpt

Giving Kubernetes Superpowers to everyone

ai devops kubernetes llama openai sre tooling

Last synced: 27 Sep 2024

https://github.com/jonmosco/kube-ps1

Kubernetes prompt info for bash and zsh

bash containers kubectl kubernetes kubernetes-helper prompts sre zsh

Last synced: 28 Sep 2024

https://github.com/leandromoreira/cdn-up-and-running

CDN Up and Running - Building a CDN from Scratch to Learn about CDN, Nginx, Lua, Prometheus, Grafana, Load balancing, and Containers.

cdn docker-compose grafana load-balancer lua luajit nginx openresty prometheus sre tutorial wrk

Last synced: 27 Sep 2024

https://github.com/bregman-arie/sre-checklist

A checklist of anyone practicing Site Reliability Engineering

automation checklist gitops kubernetes reliability-engineering sre terraform

Last synced: 29 Sep 2024

https://github.com/alibaba/sreworks

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

aiops application cloudnative dataops devops engineering flink k8s kubernetes maintenance oam operation ops saas sre

Last synced: 28 Sep 2024

https://github.com/alibaba/SREWorks

Cloud Native DataOps & AIOps Platform | 云原生数智运维平台

aiops application cloudnative dataops devops engineering flink k8s kubernetes maintenance oam operation ops saas sre

Last synced: 31 Jul 2024

https://github.com/google/cloudprober

[Moved to cloudprober/cloudprober] An active monitoring software to detect failures before your customers do.

blackbox cloud cloudprober devops distributed-monitoring gcp golang google grafana k8s kubernetes monitoring observability ping probe prober prometheus sre stackdriver

Last synced: 28 Sep 2024

https://github.com/chame1eon/jnitrace

A Frida based tool that traces usage of the JNI API in Android apps.

android frida jni jni-api reverse-engineering sre tracer

Last synced: 30 Sep 2024

https://github.com/ergomake/layerform

Layerform helps engineers create reusable environment stacks using plain .tf files. Ideal for multiple "staging" environments.

dev-environment developer-tools devops platform-engineering sre terraform

Last synced: 01 Aug 2024

https://github.com/briefercloud/layerform

Layerform helps engineers create reusable environment stacks using plain .tf files. Ideal for multiple "staging" environments.

dev-environment developer-tools devops platform-engineering sre terraform

Last synced: 26 Sep 2024

https://github.com/liquanzhou/ops_doc

运维简洁实用手册

ops python shell sre

Last synced: 30 Sep 2024

https://github.com/idoavrah/terraform-tui

Terraform textual UI

devops iac productivity sre terraform tui

Last synced: 29 Sep 2024

https://github.com/unixorn/git-extra-commands

A collection of git utilities, useful extra git scripts, tutorials and other useful articles.

antigen bash collection devops devops-tools git hacktoberfest oh-my-zsh oh-my-zsh-plugin prezto shell-script shell-scripts sre zgenom zsh-plugin zsh-plugins

Last synced: 02 Oct 2024

https://github.com/azure/caf-terraform-landingzones

This solution, offered by the Open-Source community, will no longer receive contributions from Microsoft. Customers are encouraged to transition to Microsoft Azure Verified Modules for continued support and updates from Microsoft. Please note, this repository is scheduled for decommissioning and will be removed on July 1, 2025.

azure azure-resource-manager devops enterprise platform platform-engineering sre terraform

Last synced: 30 Sep 2024

https://github.com/Azure/caf-terraform-landingzones

This solution, offered by the Open-Source community, will no longer receive contributions from Microsoft. Customers are encouraged to transition to Microsoft Azure Verified Modules for continued support and updates from Microsoft. Please note, this repository is scheduled for decommissioning and will be removed on July 1, 2025.

azure azure-resource-manager devops enterprise platform platform-engineering sre terraform

Last synced: 02 Aug 2024

https://github.com/upgundecha/howtheyaws

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world use Amazon Web Services (AWS)

amazon-web-services automation aws cloud cloud-computing cloud-native devops hacktoberfest hacktoberfest-accepted hacktoberfest2021 infrastructure-as-code sre

Last synced: 30 Jul 2024

https://github.com/jetstack/version-checker

Kubernetes utility for exposing image versions in use, compared to latest available upstream, as metrics.

docker gcr go grafana grafana-dashboard image kubernetes prometheus quay sre utility version

Last synced: 27 Sep 2024

https://github.com/kaytu-io/kaytu

The Kaytu CLI improves the efficiency of cloud workloads by analyzing historical usage and providing tailored recommendations, such as changing instance sizes. This ensures you only pay for the resources you actually need without compromising stability.

cloud-optimization cloud-spend rightsizing sre workload-optimization

Last synced: 01 Aug 2024

https://github.com/squzy/squzy

Squzy - is a high-performance open-source monitoring, incident and alert system written in Golang with Bazel and love. Welcome to free SRE

bazel docker golang grpc monitoring opensource opensource-monitoring prometheus sitemap sre zabbix

Last synced: 31 Jul 2024

https://github.com/dingguodong/linuxbashshellscriptforops

Linux Bash Shell Script and Python Script For Ops and Devops

automation bash-script devops linux ops python-script repository-python repository-shell sre

Last synced: 01 Oct 2024

https://github.com/GoogleCloudPlatform/cloud-ops-sandbox

Cloud Operations Sandbox is an open source collection of tools that helps practitioners to learn O11y and R9y practices from Google and apply them using Cloud Operations suite of tools.

cloud cloud-native cloud-operations cloudops devops google-cloud opencensus opentelemetry operations ops-management profiler samples sre stackdriver stackdriver-logs stackdriver-monitoring stackdriver-sandbox stackdriver-trace

Last synced: 01 Aug 2024

https://github.com/ryan4yin/knowledge

(Chinese Only)Everything I know: DevOps & CloudNative, Linux, Embedded, Homelab, Music, Blockchain, AI, etc...

container devops devops-notes embedded kubernetes linux music sre

Last synced: 01 Aug 2024

https://github.com/teivah/sre-roadmap

An Opinionated Roadmap to Become an SRE (Concepts > Tools)

roadmap sre

Last synced: 01 Aug 2024

https://github.com/mehrdadrad/tcpprobe

Modern TCP tool and service for network performance observability.

docker http http2 https k8s kubernetes monitoring observability probe socket sre tcp

Last synced: 01 Aug 2024

https://github.com/waltenne/guiadevopsbrasil

Repositório para compartilhamento de conteúdo Gratuito sobre DevOps

devops devops-tools docker documentation iac linux python sre terraform windows

Last synced: 31 Jul 2024

https://github.com/ozontech/file.d

A blazing fast tool for building data pipelines: read, process and output events. Our community: https://t.me/file_d_community

actions clickhouse elasticsearch events file gelf go http input json kafka logs observability output pipeline processing reading sre throttle tracing

Last synced: 01 Aug 2024

https://github.com/lwindolf/lzone-cheat-sheets

A collection of SRE / DevOps / system architecture cheat sheets hosted on https://lzone.de

architecture automation cheatsheet cloud devops devsecops kubernetes linux security sre

Last synced: 04 Aug 2024

https://github.com/actionjack/so-you-want-to-onboard-a-devops-engineer

Guidance on how to make your environment easier to onboard for Web Ops Engineers, SRE's and DevOps Practitioners

culture devops devops-practitioners mentoring onboard ops-engineers sre starters

Last synced: 03 Aug 2024

https://github.com/chame1eon/jnitrace-engine

Engine used by jnitrace to intercept JNI API calls.

android frida jni jni-api reverse-engineering sre tracer

Last synced: 31 Jul 2024

https://github.com/k8sgpt-ai/k8sgpt-operator

Automatic SRE Superpowers within your Kubernetes cluster

devops kubernetes openai sre tooling

Last synced: 02 Aug 2024

https://github.com/merlinn-co/merlinn

Open source AI on-call developer 🧙‍♂️ Get relevant context & root cause analysis in seconds about production incidents and make on-call engineers 10x better 🏎️

aiops alerts chatops-ai devtools in incident incident-response incident-response-tooling llm llm-agent metrics monitoring observability oncall oncall-engineers site-reliability-engineering sre traces

Last synced: 01 Aug 2024

https://github.com/kgoralski/microservice-production-readiness-checklist

The principles that help to deploy safely to the production environment. If you like it:

aws checklist cloud kubernetes microservices resiliency sre

Last synced: 01 Aug 2024

https://github.com/seznam/slo-exporter

Slo-exporter computes standardized SLI and SLO metrics based on events coming from various data sources.

alerting exporter grafana monitoring prometheus service-level-indicator service-level-objective sli slo slo-exporter sre sre-workbook

Last synced: 01 Aug 2024

https://github.com/last9/slo-computer

SLOs, Error windows and alerts are complicated. Here an attempt to make it easy

metrics observability service-level-indicator service-level-objective sla sli slo sre sre-team

Last synced: 31 Jul 2024

https://github.com/apiaryio/s3-streaming-upload

s3-streaming-upload is node.js library that listens to your stream and upload its data to Amazon S3 using ManagedUpload API.

sre

Last synced: 01 Aug 2024

https://github.com/adhorn/aws-chaos-scripts

DEPRECATED Collection of python scripts to run failure injection on AWS infrastructure

amazon-web-services aws chaos-engineering chaos-monkey deprecated software-engineering sre

Last synced: 02 Aug 2024

https://github.com/getstrake/developer-cost-guide

SQL code for developers to understand AWS cloud costs. Reduce time spent on billing, get back to engineering. Created and maintained by the team at Macroscope.

aws cloud cost-estimation finops sre

Last synced: 06 Aug 2024

https://github.com/dentrax/falco-gpt

AI-generated remediations for Falco audit events

audit-log chatgpt devops falco golang kubernetes openai sre sysdig threat-hunting tooling

Last synced: 27 Sep 2024

https://github.com/Dentrax/falco-gpt

AI-generated remediations for Falco audit events

audit-log chatgpt devops falco golang kubernetes openai sre sysdig threat-hunting tooling

Last synced: 01 Aug 2024

https://github.com/adhorn/aws-fis-templates-cdk

Collection of AWS Fault Injection Simulator (FIS) experiment templates deploy-able via the AWS CDK

amazon-web-services automation aws aws-fis cdk-examples cdk-library chaos-engineering chaos-testing devops-tools sre testing

Last synced: 04 Aug 2024

https://github.com/microsoft/SQLCallStackResolver

A sample tool for users of Microsoft SQL Server to aid in troubleshooting otherwise difficult to diagnose issues. Provided AS-IS - see SUPPORT.md.

azuresql azuresqldb azuresqlmanagedinstance callstack debugging debugging-symbol msdia140 pdb pdb-files sqlserver sqlserver-2017 sqlserver-2019 sqlserver-2022 sre symbols tool xevent xevents

Last synced: 01 Aug 2024

https://github.com/ory/jobs

Want to build the next generation identity stack? You've come to the right place!

go hiring jobs kubernetes open-source opensource ory react sre

Last synced: 31 Jul 2024

https://github.com/sitectl/cuttle

Blue Box SRE Operations Platform

ansible bastion bluebox elk operations sensu sre

Last synced: 01 Aug 2024

https://github.com/Excoriate/go-terradagger

TerraDagger is a Go package for managing your infrastructure-as-code through containers.

cli devops ecs example sre tooling

Last synced: 02 Aug 2024

https://github.com/fkie-cad/logprep

log data pre processing in python

elasticsearch etl kafka logdata opensearch preprocessing python soar sre

Last synced: 30 Sep 2024

https://github.com/immobiliare/collectd-haproxy-plugin

Collectd plugin to pull metrics from HAProxy instances

collectd collectd-plugin grafana haproxy metrics monitoring sre

Last synced: 01 Aug 2024

https://github.com/grafana/xk6-chaos

xk6 extension for running chaos experiments with k6 💣

chaos chaos-engineering k6-extension reliability sre testing xk6

Last synced: 27 Sep 2024

https://github.com/better-sre/config

config files, Dockerfiles, Taskfiles for Developers.

docker dotfiles flutter go-task golang python rust sre taskfile

Last synced: 03 Aug 2024

https://github.com/operate-first/operations

The sig-operations repository.

site-reliability-engineering sre

Last synced: 07 Aug 2024

https://github.com/butuzov/todayilearned

Because I Can't Trust My Memory

bash go jupyter linux python sre

Last synced: 02 Aug 2024

https://github.com/todd-dsm/mac-ops

QnD Automation to build a MacBook Pro for DevOps

customizable devops devops-tools macbook-configuration macbook-setup macos sre

Last synced: 01 Aug 2024

https://github.com/tedilabs/terraform-aws-vpc-connectivity

🌳 A sustainable Terraform Package which creates VPC Connectivity resources (Private Link, Client VPN, Site-to-Site VPN, DX, VPC Lattice) on AWS

aws aws-client-vpn aws-direct-connect aws-dx aws-site-to-site-vpn aws-vpc aws-vpc-lattice aws-vpc-private-link aws-vpn devops hacktoberfest hcl2 iac lang-hcl sre tedilabs terraform terraform-aws terraform-module terraform-modules

Last synced: 26 Sep 2024

https://github.com/johndeere/work-tracker

Observe and protect your Java web application.

elasticsearch java java11 java8 mdc metadata observability spring spring-boot sre

Last synced: 28 Sep 2024

https://github.com/googlecloudplatform/reliable-app-platforms

A MVP of a platform for delivering reliable applications on Google Cloud

gke google-cloud kubernetes reliability slos sre terraform

Last synced: 28 Sep 2024

https://github.com/tedilabs/terraform-aws-lambda

🌳 A sustainable Terraform Package which creates Lambda & Step Functions resources on AWS

aws aws-lambda aws-sfn aws-step-functions devops hacktoberfest hcl2 iac lang-hcl sre tedilabs terraform terraform-aws terraform-module terraform-modules

Last synced: 26 Sep 2024

https://github.com/tedilabs/terraform-aws-ipam

🌳 A sustainable Terraform Package which creates IPAM resources (IPAM, Elastic IP, Prefix List) on AWS

aws aws-eip aws-elastic-ip aws-ipam aws-prefix-list aws-vpc devops hacktoberfest hcl2 iac lang-hcl sre tedilabs terraform terraform-aws terraform-module terraform-modules

Last synced: 26 Sep 2024

https://github.com/jayvdb/sre-tools

Helpers for sre_parse, transforming regexes

python3 regex regular-expressions sre sre-parse

Last synced: 02 Oct 2024

https://github.com/johndeere/outstanding

A Java concurrent collection for in-progress work

java java-collections java8 sre

Last synced: 28 Sep 2024

https://github.com/amaurybsouza/terraform-aws-ec2-ssh

Amazon Elastic Compute Cloud (EC2) is a web service that provides resizable compute capacity in the cloud. It is one of the core services offered by Amazon Web Services (AWS) and provides a wide range of features and capabilities.

aws aws-ec2 devops devops-tools github github-actions infrastructure-as-code infrstructure sre terraform terraform-aws terraform-managed terraform-modules terraform-provider

Last synced: 26 Sep 2024

https://github.com/tedilabs/terraform-okta-modules

🌳 A sustainable Terraform Package which manage all of things on Okta

devops hacktoberfest hcl2 iac lang-hcl okta sre tedilabs terraform terraform-module terraform-modules terraform-okta

Last synced: 26 Sep 2024

https://github.com/tiagotartari/observability-dotnet-opentelemetry-first-steps

This project demonstrates how to implement observability in .NET applications using OpenTelemetry.

dotnet dotnet8 logs metrics observability opentelemetry opentelemetry-collector opentelemetry-dotnet sre traces

Last synced: 26 Sep 2024

https://github.com/tedilabs/terraform-tfe-modules

🌳 A sustainable Terraform Package to manage all of things on Terraform Enterprise (Terraform Cloud)

devops hacktoberfest hcl2 iac lang-hcl sre tedilabs terraform terraform-cloud terraform-enterprise terraform-module terraform-modules tfe

Last synced: 26 Sep 2024

https://github.com/toolsascode/gomodeler

Go Modeler is a small CLI and Library that brings the powerful features of the golang template into a simplified form.

ci cloud devops github-actions infra it pipeline platform sre

Last synced: 30 Sep 2024

https://github.com/ranching-farm/kubectl-addon

Kubectl addon for connecting Kubernetes clusters to ranching.farm - an AI-powered Kubernetes management platform. Simplify cluster operations and get intelligent assistance for common tasks.

ai-assistant ai-assisted cluster-management devops helm k8s krew krew-plugin kubectl kubectl-commands kubectl-plugin kubectl-plugins kubernetes kustomize ranching-farm sre

Last synced: 26 Sep 2024