awesome-devops-platform
🚀 A curated list of awesome Platform Engineering tools, practices, and resources for building modern Internal Developer Platforms (IDPs) and cloud-native infrastructure
https://github.com/tysoncung/awesome-devops-platform
Last synced: 11 days ago
JSON representation
-
AI & Automation in DevOps
-
AIOps Tools
- Moogsoft - AI-powered observability.
- BigPanda - Event correlation and automation.
- PagerDuty AIOps - Intelligent incident management.
-
AI-Powered Operations
- Dynatrace Davis - AI engine for automatic problem detection.
- New Relic AI - Applied intelligence for DevOps.
- Datadog AI - AI-powered monitoring and analytics.
- Datadog AI - AI-powered monitoring and analytics.
- Datadog AI - AI-powered monitoring and analytics.
- Splunk AI - Machine learning for IT operations.
-
Automation Platforms
- Terraform Cloud - Infrastructure automation as a service.
- Zapier for DevOps - Workflow automation.
- n8n - Workflow automation platform.
-
-
Cloud Development Kits
-
Kubernetes CDKs
- CDK8s - Define Kubernetes applications using code.
- Kubernetes Operators - Application-specific controllers.
- Kubebuilder - SDK for building Kubernetes APIs.
-
Multi-Cloud CDKs
- AWS CDK - Cloud Development Kit for AWS.
- Azure Bicep - DSL for deploying Azure resources.
- Pulumi CDK - Multi-cloud infrastructure as code.
- CDKTF - CDK for Terraform.
- Google Cloud Deployment Manager - Infrastructure deployment service.
-
-
Communities
-
Conferences
- PlatformCon - Platform engineering conference.
- KubeCon + CloudNativeCon - Cloud native conference.
- SREcon - Site reliability engineering conference.
-
Forums & Discussion
- r/platformengineering - Platform engineering subreddit.
- r/devops - DevOps subreddit.
- Stack Overflow DevOps - Q&A for DevOps.
- DevOps.com - DevOps news and articles.
-
Organizations
- Platform Engineering Community - Global platform engineering community.
- CNCF (Cloud Native Computing Foundation) - Cloud native ecosystem.
- DevOps Community Hub - DevOps community and education.
- SRE Community - Site Reliability Engineering community.
- DevOps Community Hub - DevOps community and education.
-
Slack Communities
- Kubernetes Slack - Kubernetes community.
- DevOps Chat - DevOps professionals community.
- SRE Community Slack - SRE discussions.
- Platform Engineering Slack - Join the platform engineering Slack community.
- Platform Engineering Slack - Join the platform engineering Slack community.
- DevOps Chat - DevOps professionals community.
- Platform Engineering Slack - Join the platform engineering Slack community.
- Platform Engineering Slack - Join the platform engineering Slack community.
-
-
Cost Management
-
Cost Optimization
- CAST AI - Kubernetes cost optimization platform.
- CloudHealth - Cloud management platform.
-
FinOps Tools
- Cloud Custodian FinOps - Cloud governance and cost optimization.
- Vantage - Cloud cost transparency platform.
- OpenCost - Open source Kubernetes cost monitoring.
-
-
Developer Experience Tools
-
CLI Tools
- GitHub CLI - GitHub from the command line.
- kubectl - Kubernetes command-line tool.
- k9s - Terminal UI for Kubernetes.
- Lens - Kubernetes IDE.
- Teleport - Secure access to infrastructure.
-
Development Environments
- Eclipse Che - Kubernetes-native IDE.
-
Local Development
-
Remote Development
- GitHub Codespaces - Cloud development environments.
- Gitpod - Cloud development environment.
- Coder - Self-hosted remote development.
- Code-Server - VS Code in the browser.
-
Testing & Quality
- LocalStack - Local AWS cloud stack.
- k6 - Load testing for engineering teams.
- Cypress - End-to-end testing framework.
-
-
Developer Portals
-
Documentation Platforms
- Stoplight - API design and documentation platform.
-
Service Catalogues
- Backstage Software Catalog - Centralized system model.
- ServiceNow Service Catalog - Enterprise service management.
- Port Service Catalog - Comprehensive service registry.
-
-
Disaster Recovery & Backup
-
Backup Solutions
- Velero - Kubernetes backup and migration.
- Veeam - Backup, recovery, and data management.
- Restic - Fast, secure backup program.
- Kasten K10 - Kubernetes data management.
-
Data Replication
-
Disaster Recovery
- Zerto - Cloud data management and protection.
- VMware Site Recovery Manager - Disaster recovery automation.
- AWS Backup - Centralized backup for AWS.
- Azure Site Recovery - DR as a service.
- VMware Site Recovery Manager - Disaster recovery automation.
- VMware Site Recovery Manager - Disaster recovery automation.
- Azure Site Recovery - DR as a service.
-
-
Environment Management
-
Environment Provisioning
- Terraform Cloud - Collaborative infrastructure provisioning.
- Pulumi Service - Infrastructure provisioning and management.
- AWS Control Tower - Multi-account AWS environment setup.
-
Ephemeral Environments
-
Preview Environments
-
-
GitOps Tools
-
GitOps Frameworks
- Kustomize - Kubernetes native configuration management.
- Helm - Package manager for Kubernetes.
- Config Sync - Google's GitOps for Kubernetes.
-
GitOps Operators
- ArgoCD - Declarative GitOps CD for Kubernetes.
- Flux - GitOps toolkit for Kubernetes.
- Fleet - GitOps at scale for Kubernetes.
- Weave GitOps - Enterprise GitOps platform.
-
-
Infrastructure as Code
-
Provisioning Tools
- OpenTofu - Open-source Terraform fork.
- Terragrunt - Terraform wrapper for DRY configurations.
- Terramate - Terraform orchestration and code generation.
-
-
Internal Developer Platforms (IDPs)
-
Commercial Platforms
- Configure8 - Developer portal and service catalogue.
-
Open Source Platforms
- Humanitec Platform Orchestrator - Dynamic Internal Developer Platform orchestrator.
- Otomi - Self-hosted PaaS for Kubernetes.
- KubeVela - Application delivery platform based on OAM.
- Gimlet - Application deployment platform with GitOps.
- Qovery - Platform to deploy applications on AWS, GCP, Azure.
- mogenius - Virtual DevOps platform.
-
-
Learning Resources
-
Courses & Certifications
- Platform Engineering Academy - Comprehensive platform engineering courses.
- CNCF Certifications - Cloud native certifications.
- AWS Certifications - AWS cloud certifications.
- DevOps Institute Training - DevOps certifications.
- CNCF Certifications - Cloud native certifications.
- DevOps Institute Training - DevOps certifications.
-
Podcasts
- Platform Engineering Podcast - Insights from platform engineering leaders.
- Kubernetes Podcast - Weekly news and interviews.
- DevOps Paradox - DevOps discussions.
- The POPCAST - Platform engineering stories.
- Platform Engineering Podcast - Insights from platform engineering leaders.
-
Tutorials & Labs
- KillerCoda - Hands-on cloud native scenarios.
- Play with Kubernetes - Kubernetes playground on KillerCoda.
- Instruqt - Hands-on virtual IT labs.
-
YouTube Channels
- DevOps Toolkit - Viktor Farcic's DevOps content.
- CNCF - Cloud Native Computing Foundation.
- DevOps Toolkit - Viktor Farcic's DevOps content.
-
-
Multi-Cloud & Hybrid
-
Cloud Abstraction
- Apache Libcloud - Python library for cloud providers.
- Fog - Ruby cloud services library.
- Pkgcloud - Node.js cloud abstraction library.
- Apache jclouds - Java multi-cloud toolkit.
-
Multi-Cloud Management
- AWS Outposts - AWS infrastructure on-premises.
- Rancher - Multi-cluster Kubernetes management.
- Azure Arc - Multi-cloud and edge management.
- VMware Tanzu - Multi-cloud application platform.
-
-
Multi-Cloud Management
-
Cloud Cost Management
- Spot by NetApp - Cloud infrastructure optimization.
- CloudZero - Cloud cost intelligence.
- Apptio Cloudability - Financial management for cloud.
- Harness Cloud Cost Management - Automated cost optimization.
-
Multi-Cloud Networking
- Alkira - Cloud networking as a service.
- Tigera Calico - Cloud-native networking and security.
-
Multi-Cloud Platforms
- Terraform - Multi-cloud infrastructure automation.
- Pulumi - Universal infrastructure as code.
- Crossplane - Universal control plane for multi-cloud.
- CloudBolt - Hybrid cloud management platform.
-
-
Observability Platforms
-
Distributed Tracing
- Jaeger - Distributed tracing platform.
- Zipkin - Distributed tracing system.
- Tempo - High-volume distributed tracing backend.
- AWS X-Ray - Distributed tracing for AWS.
- Google Cloud Trace - Distributed tracing for GCP.
- Google Cloud Trace - Distributed tracing for GCP.
-
Logging
- Loki - Log aggregation system by Grafana.
- Fluentd - Unified logging layer.
- Vector - High-performance observability data pipeline.
- Fluent Bit - Fast and lightweight log processor.
- Elastic Stack (ELK) - Search, analyze, and visualize logs.
-
Metrics & Monitoring
-
-
Platform Engineering Fundamentals
-
Articles & Papers
- What is Platform Engineering? - Definition and principles.
- Platform Engineering Blog - Latest articles and industry insights.
- Platform Tooling Landscape - Guide to choosing the right tools for your IDP.
-
Books & Guides
- Team Topologies - Organising business and technology teams for fast flow.
- Platform Engineering Fundamentals - Certification program and comprehensive guide.
- Internal Developer Platforms Guide - Comprehensive resource on building IDPs.
- Platform Engineering - Building robust cloud platforms.
-
-
Platform Metrics & Analytics
-
Cost Analytics
- Infracost - Cloud cost estimates for IaC.
-
Developer Experience Metrics
- DORA Metrics - Four key DevOps metrics.
- Sleuth - DORA metrics tracking.
- LinearB - Engineering effectiveness metrics.
- Haystack - Engineering productivity analytics.
-
Platform Observability
- Prometheus - Metrics collection and alerting.
- Grafana - Analytics and monitoring.
- New Relic - Application performance monitoring.
-
-
Platform Orchestration
-
Resource Orchestration
- Kubernetes - Container orchestration platform.
- Nomad - Simple and flexible workload orchestrator.
- Docker Swarm - Container orchestration built into Docker.
- Apache Mesos - Distributed systems kernel.
-
Workflow Orchestration
- Temporal - Workflow orchestration platform.
- Argo Workflows - Container-native workflow engine.
- Apache Airflow - Platform to programmatically author workflows.
- Prefect - Modern workflow orchestration.
- Dagster - Data orchestrator for machine learning.
-
-
Platform Templates & Blueprints
-
Platform Examples
- Kubernetes Examples - Kubernetes application examples.
- AWS CDK Examples - CDK code examples.
- Terraform Examples - Terraform patterns and practices.
- IDP Reference Architectures - Reference architectures and examples.
-
Reference Architectures
- AWS Well-Architected Framework - Best practices for cloud architectures.
- Azure Architecture Center - Azure reference architectures.
- Google Cloud Architecture Framework - GCP best practices.
- CNCF Cloud Native Trail Map - Path to cloud native adoption.
-
Starter Kits
- Projen - Project generator for modern applications.
- Yeoman - Scaffolding tool for modern web apps.
- Backstage Software Templates - Scaffolding for services.
- Cookiecutter - Project templates for any language.
-
-
Policy as Code
-
Compliance Tools
- CloudQuery - Cloud asset inventory and compliance.
- Steampipe - Query cloud infrastructure with SQL.
- Prowler - AWS/Azure/GCP security assessments.
- ScoutSuite - Multi-cloud security auditing.
-
Policy Engines
- Open Policy Agent (OPA) - General-purpose policy engine.
- Kyverno - Kubernetes-native policy management.
- Checkov - Static code analysis for IaC.
- jsPolicy - JavaScript-based Kubernetes admission control.
-
Security Policies
-
Programming Languages
Categories
Communities
20
Learning Resources
17
Developer Experience Tools
16
Team Collaboration & Communication
15
Disaster Recovery & Backup
14
Observability Platforms
13
AI & Automation in DevOps
12
Platform Templates & Blueprints
12
Policy as Code
12
Secrets Management
11
Environment Management
10
Multi-Cloud Management
10
Service Mesh & Networking
9
Platform Orchestration
9
Platform Metrics & Analytics
8
Cloud Development Kits
8
Multi-Cloud & Hybrid
8
GitOps Tools
7
Internal Developer Platforms (IDPs)
7
Platform Engineering Fundamentals
7
Progressive Delivery
7
Cost Management
5
Security & Compliance
5
Developer Portals
4
Infrastructure as Code
3
Sub Categories
Slack Communities
8
Disaster Recovery
7
AI-Powered Operations
6
Open Source Platforms
6
Distributed Tracing
6
Courses & Certifications
6
Multi-Cloud CDKs
5
Organizations
5
Documentation Platforms
5
Service Mesh Platforms
5
Workflow Orchestration
5
Logging
5
Podcasts
5
Compliance Tools
5
CLI Tools
5
Multi-Cloud Management
4
Cloud Abstraction
4
Kubernetes Secrets
4
Starter Kits
4
Cloud Cost Management
4
Multi-Cloud Platforms
4
API Gateways
4
Policy Engines
4
Backup Solutions
4
Feature Flags
4
Resource Orchestration
4
Security Policies
4
Reference Architectures
4
Remote Development
4
Knowledge Management
4
Books & Guides
4
Secret Scanning
4
Team Communication
4
GitOps Operators
4
Platform Examples
4
Preview Environments
4
Forums & Discussion
4
Developer Experience Metrics
4
FinOps Tools
3
AIOps Tools
3
Secrets Stores
3
Tutorials & Labs
3
Platform Observability
3
Environment Provisioning
3
Canary Deployment
3
Kubernetes CDKs
3
Conferences
3
GitOps Frameworks
3
Testing & Quality
3
Developer Portals
3
Ephemeral Environments
3
Local Development
3
Security Scanning
3
YouTube Channels
3
Articles & Papers
3
Automation Platforms
3
Provisioning Tools
3
Data Replication
3
Service Catalogues
3
Cost Optimization
2
Multi-Cloud Networking
2
Metrics & Monitoring
2
Cost Analytics
1
Development Environments
1
Commercial Platforms
1
Secrets Management
1
Keywords
security
5
aws
4
security-tools
3
devsecops
3
azure
3
gcp
3
secret
2
secret-management
2
cloud
2
data-loss-prevention
1
devops
1
dlp
1
git
1
gitleaks
1
go
1
golang
1
llm
1
llm-inference
1
llm-training
1
open-source
1
cli
1
cicd
1
ci-cd
1
ai-powered
1
vscode-remote
1
vscode
1
remote-work
1
ide
1
development-environment
1
dev-tools
1
browser-ide
1
sops
1
secret-distribution
1
pgp
1
cloudsecurity
1
compliance
1
cspm
1
forensics
1
gdpr
1
hardening
1
iam
1
multi-cloud
1
python
1
security-audit
1
security-hardening
1
well-architected
1
cdk
1
cdk-examples
1
arm-json
1
arm-templates
1