Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/techiescamp/kubernetes-learning-path
A roadmap to learn Kubernetes from scratch (Beginner to Advanced level)
https://github.com/techiescamp/kubernetes-learning-path
k8s kubernetes kubernetes-cluster kubernetes-deployment kubernetes-learning kubernetes-learning-apth kubernetes-security kubernetes-setup
Last synced: 5 days ago
JSON representation
A roadmap to learn Kubernetes from scratch (Beginner to Advanced level)
- Host: GitHub
- URL: https://github.com/techiescamp/kubernetes-learning-path
- Owner: techiescamp
- Created: 2022-11-16T05:02:58.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-10-24T15:51:22.000Z (3 months ago)
- Last Synced: 2024-10-29T09:54:32.585Z (2 months ago)
- Topics: k8s, kubernetes, kubernetes-cluster, kubernetes-deployment, kubernetes-learning, kubernetes-learning-apth, kubernetes-security, kubernetes-setup
- Homepage: https://devopscube.com/learn-kubernetes-complete-roadmap/
- Size: 1.01 MB
- Stars: 6,810
- Watchers: 149
- Forks: 861
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- backend-cheats - **Kubernetes Learning Roadmap** – GitHub
- awesome-tools - kubernetes-learning-path - A roadmap to learn Kubernetes from scratch (Beginner to Advanced level) (Uncategorized / Uncategorized)
README
## Hit the Star! :star:
If you are planning to use this repo for reference, please hit the star. Thanks!
## Kubernetes Learning Roadmap
The Kubernetes Learning Roadmap is constantly updated with new content, so you can be sure that you're getting the latest and most up-to-date information available.
## Kubernetes Certification Voucher (UpTo 35% OFF) 🎉
>**Important Note:** Kubernetes certification prices are increasing this month. So make use of this offer to lockin the savings.
As part of our commitment to helping the DevOps community save money on Kubernetes Certifications, we continuously update the latest voucher codes from the Linux Foundation
🚀 CKA, CKAD, CKS, or KCNA exam aspirants can **save 30%** today using code **DCUBE30** at https://kube.promo/devops. It is a limited-time offer from the Linux Foundation.
The following are the best bundles to **save upto 35%** with code **DCUBE30**
- KCNA + KCSA + CKA + CKAD + CKS (35% Savings): [kube.promo/kubestronaut](https://kube.promo/kubestronaut)
- CKA + CKAD + CKS Exam bundle (35% Savings): [kube.promo/k8s-bundle](https://kube.promo/k8s-bundle)
- CKA + CKS Bundle (35% Savings) [kube.promo/bundle](https://kube.promo/bundle)
- KCNA + CKA (35% Savings) [kube.promo/kcka-bundle](https://kube.promo/kcna-cka)
- KCSA + CKS Exam Bundle (35% Savings) [kube.promo/kcsa-cks](https://kube.promo/kcsa-cks)
- KCNA + KCSA Exam Bundle (35% Savings) [kube.promo/kcna-kcsa](https://kube.promo/kcna-kcsa)>Note: You have one year of validity to appear for the certification exam after registration
## Kubernetes Learning Prerequisites
If you want to learn Kubernetes, it's important to start with the basics. That means brushing up on your IT fundamentals first because Kubernetes builds on those. Once you have a good grasp of the basics, learning Kubernetes can be fun and easy. So don't skip the fundamentals – take some time to study them before diving into Kubernetes!
- [Learn Container concepts & Container Management Tool- Docker/Podman](https://techiescamp.com/p/container-fundamentals-course)Free Course
- [Understand Distributed system](https://www.freecodecamp.org/news/a-thorough-introduction-to-distributed-systems-3b91562c9b3c) Blog
- [Understand Authentication & Authorization](https://www.okta.com/identity-101/authentication-vs-authorization/) Blog
- [Learn Basics of Key Value Store](https://redis.com/nosql/key-value-databases/)Blog
- [Learn the basics of REST API](https://blog.postman.com/intro-to-apis-what-is-an-api/)Blog
- [Learn YAML](https://www.educative.io/blog/yaml-tutorial?aff=KNLz)Blog
- [Understand Service Discovery](https://devopscube.com/service-discovery-explained/) Blog
- Learn Networking Basics
- [L4 & L7 Layers (OSI Layers)](https://www.cloudflare.com/en-gb/learning/ddos/glossary/open-systems-interconnection-model-osi/)Blog
- [SSL/TLS](https://www.cloudflare.com/en-gb/learning/ssl/how-does-ssl-work/)Blog
- [Network Proxy Basics](https://stackoverflow.com/questions/224664/whats-the-difference-between-a-proxy-server-and-a-reverse-proxy-server)Blog
- [DNS](https://www.cloudflare.com/en-gb/learning/dns/what-is-dns/)Blog
- [IPTables](https://www.youtube.com/watch?v=6Ra17Qpj68c)Video & [NFTables](https://www.linode.com/docs/guides/how-to-use-nftables/)Video
- [Software Defined Networking (SDN)](https://www.vmware.com/topics/glossary/content/software-defined-networking.html)Blog## Kubernetes Architecture
The following image shows the high-level kubernetes architecture and how external services connect to the cluster.
![02-k8s-architecture-github](https://github.com/user-attachments/assets/6b514c13-56ba-4474-83c8-469942fc43f6)
Refer to the following documents to learn about every kubernetes component in detail.
- [Kubernetes Architecture Explained](https://devopscube.com/kubernetes-architecture-explained/)Blog
## $1000+ Free Cloud Credits to Launch Clusters
Launching large clusters in the cloud can be costly. So utilize the available cloud credits to practice deploying clusters as if you work on a real project. All cloud platforms offer managed Kubernetes services.
- [GKE -Google Cloud $300 free credits](https://cloud.google.com/kubernetes-engine)Cloud Platform
- [EKS - AWS $300 free POC credits](https://pages.awscloud.com/GLOBAL_NCA_LN_ARRC-program-A300-2023.html)Cloud Platform
- [DO Kubernetes - Digital Ocean – $200 free credits](https://devopscube.com/recommends/digital-ocean-sidebar/)Cloud Platform
- [Linode Kubernetes Engine - Linode Cloud – $100 Free credits](https://devopscube.com/recommends/linode-credits/)Cloud Platform
- [Vultr Kubernetes Engine - Vultr Cloud - $250 Free Credits](https://devopscube.com/recommends/vultr-credits/)Cloud Platform
- [AKS - Azure Cloud Hosting - $200 Free Credits](https://azure.microsoft.com/en-us/free/)Cloud Platform
## Kubernetes Cluster Setup & AdministrationAs DevOps engineers, gaining a thorough understanding of each component and cluster configuration is crucial to work in production environments. Though there are various methods for deploying a Kubernetes cluster, it is advisable to learn how to set up multi-node clusters from scratch. This allows you to gain knowledge on concepts such as High Availability, Scaling, and Networking and simulates a real-world project.
Additionally, mastering the configuration of multi-node clusters can be beneficial for interviews and building confidence in your abilities. The following are recommended ways to establish a Kubernetes cluster.
- [Kubernetes the Hard Way](https://github.com/kelseyhightower/kubernetes-the-hard-way)Github
- [Kubeadm Cluster Setup](https://devopscube.com/setup-kubernetes-cluster-kubeadm/)Blog
- [Minikube Development Cluster ](https://devopscube.com/kubernetes-minikube-tutorial/)Blog
- [Kind Development Cluster](https://kind.sigs.k8s.io/)Official DocumentationFollowing are some of the important cluster administrative tasks
- [Deploy Kubernetes Dashboard](https://kubernetes.io/docs/tasks/access-application-cluster/web-ui-dashboard/)Official Doc
- [Important Kubernetes Cluster Configurations](https://devopscube.com/kubernetes-cluster-configurations/)Blog
- [Kubeadm Cluster Upgrade](https://devopscube.com/upgrade-kubernetes-cluster-kubeadm/)Blog
- [etcd backup using etcdctl](https://devopscube.com/backup-etcd-restore-kubernetes/)Blog## Understand KubeConfig File
As a DevOps engineer, it is important to become familiar with the Kubeconfig file. It is crucial for tasks such as setting up cluster authentication for CI/CD systems, providing cluster access to developers, and more.
A Kubeconfig file is a YAML file that stores information and credentials for connecting to a Kubernetes cluster. It is used by command-line tools such as kubectl and other client libraries to authenticate with the cluster and interact with its resources.
The Kubeconfig file can be used to store information for multiple clusters and users, allowing users to switch between different clusters and contexts easily. It is an important tool for managing access to and interacting with Kubernetes clusters.
Refer to the following document to learn about the Kubeconfig File in detail.
- [Kubeconfig File Explained With Practical Examples](https://devopscube.com/kubernetes-kubeconfig-file/) Blog
## Understand Kubernetes Objects And Resources
In Kubernetes, an object is a persisted entity in the cluster that represents a desired state of the system. It is created and managed by the Kubernetes API server and is stored in the etcd key-value store. Examples of Kubernetes objects include pods, services, and deployments.
Here is an example of a Pod Object
apiVersion: v1
kind: Pod
metadata:
name: nginx
spec:
containers:
- name: nginx
image: nginx:1.14.2
ports:
- containerPort: 80A resource is a representation of a Kubernetes object that is exposed by the Kubernetes API. It is a way for clients to interact with and manipulate objects in the cluster.
A resource refers to a specific API URL used to access an object. Resources are typically accessed through the Kubernetes API using HTTP verbs such as GET, POST, and DELETE. For instance, the
/api/v1/pods
resource can be used to retrieve a list of v1 Pod objects. Additionally, an individual v1 Pod object can be obtained from the/api/v1/namespaces/**namespace-name**/pods/**pod-name**
resource.**Detailed Blog:** [Kubernetes Objects & Resources Explained](https://devopscube.com/kubernetes-objects-resources/)
## Learn About the Object YAML Structure
Every object in Kubernetes is represented/created using a YAML file.
Kubernetes has many native objects (20+), however, every object YAML follows a hierarchical structure as shown below.
```
apiVersion:
kind:
metadata:
name:
spec:
>
```
Here is what each section means.- **apiVersion:** Specifies the Kubernetes API version used for the object.
- **kind:** Defines the type of Kubernetes object being created or modified.
- **metadata:** Contains information about the object.
- **spec:** Defines the desired state of the object, including its configuration and behavior. Under spec, there could be many subfields depending on the object type.The structure remains the same for all native Kubernetes objects. While learning about each object, you can check the hierarchy, and you will be able to relate.
## Learn All Pod Concepts & Features
All the essential concepts in Kubernetes center around the Pod. Understanding Pods in detail, along with their supported features, is crucial for anyone working with Kubernetes because many other objects in Kubernetes are built around them. Below are comprehensive guides that delve into various aspects of the Pod with real-world practical examples.
- [Kubernetes Pod Explained](https://devopscube.com/kubernetes-pod/) Blog
- [multi-container pods](https://www.mirantis.com/blog/multi-container-pods-and-container-communication-in-kubernetes/)Blog
- [Init Containers Explained](https://devopscube.com/kubernetes-init-containers/) Blog
- [Pod Lifecycle Phases](https://devopscube.com/kubernetes-pod-lifecycle/)Blog
- [Pod Priority, PriorityClass, and Preemption](https://devopscube.com/pod-priorityclass-preemption/)Blog
- [Pod Quality or Service - QoS](https://blog.techiescamp.com/kubernetes-pod-qos/)Blog
- [Troubleshoot Pod](https://devopscube.com/troubleshoot-kubernetes-pods/)Blog
- [Container Lifecyle Hooks](https://kubernetes.io/docs/concepts/containers/container-lifecycle-hooks/)Official Doc
- [Pod Disruption Budget](https://cast.ai/blog/pod-disruption-budgets-in-your-deployment/)Blog
- [Pod Affinity/Anti-Affinity](https://www.densify.com/kubernetes-autoscaling/kubernetes-affinity/)Blog
- [Pod Labels & Selectors](https://www.split.io/blog/kubernetes-labels-best-practices/)BlogIn the topics above, we've covered all the core concepts of Pods that are used in production-level implementations. You should practice these concepts hands-on. Once you have a solid practical understanding of Pods, you can move on to learning about objects that depend on Pods.
## Learn About Pod Dependent Objects
Running applications on a single pod can be a single point of failure. That's why Kubernetes provides various objects that use pods to make applications highly available.
### 1. ReplicaSet
Makes sure a specific number of pod replicas are running at all times. If a pod crashes, the ReplicaSet starts a new one.**Use Case:** Good for stateless applications where you need multiple identical pods.
**Detailed Blog:** [Replicaset Guide](https://sysdig.com/learn-cloud-native/kubernetes-101/kubernetes-replicasets-overview/)
---
### 2. Deployment
Manages ReplicaSets and allows for easy updates and rollbacks. It also scales the number of pod replicas.
**Use Case:** Useful for stateless applications and when you need to update or rollback easily.
**Detailed Blog:** [Deployment Explained](https://codefresh.io/learn/kubernetes-deployment/)
---
### 3. StatefulSet
Like a Deployment, but for stateful applications. It gives each pod a unique identity.
**Use Case:** Good for databases and other stateful applications.
**Detailed Blog:** [Statefulset Explained](https://loft.sh/blog/kubernetes-statefulset-examples-and-best-practices/)
---
### 4. DaemonSet
Ensures that each node in the cluster runs a copy of a pod.
**Use Case:** Useful for node monitoring or logging agents.
**Detailed Blog:** [Daemonset Explained](https://devopscube.com/kubernetes-daemonset/)
---
### 5. Job
Creates one or more pods and ensures that a specified number of them are completed successfully.
**Use Case:** Good for batch processing tasks.
**Detailed Blog:** [Kubernetes Jobs Practical Guide](https://devopscube.com/create-kubernetes-jobs-cron-jobs/)
---
### 6. CronJob
Like a Job, but runs at specific times or intervals.
**Use Case:** Useful for scheduled tasks like backups.
**Detailed Blog:** [Kubernetes CronJobs Practical Guide](https://devopscube.com/create-kubernetes-jobs-cron-jobs/)
## Learn About Services
Applications deployed on Pods using deployments may need to be accessed either internally within the cluster by other services or externally from outside the cluster.
The feature that facilitates this access to Pods is known as Services in Kubernetes.
Services provide a stable IP address and DNS name, enabling seamless communication and load balancing among Pods, regardless of their lifecycle changes. This ensures that the network connectivity to applications remains consistent and reliable.
**Detailed Blog:** [Kubernetes Service Explained Visually](https://medium.com/swlh/kubernetes-services-simply-visually-explained-2d84e58d70e5)
## Learn About Ingress & Ingress Controller
Applications running inside Kubernetes require more than just a single endpoint. For instance, if you are running an e-commerce microservice application, you may need to route incoming requests to multiple backend services based on the request path. This is where the Ingress object comes into play.
Ingress Acts like a "front door" to manage incoming traffic to multiple Kubernetes backend services. It Works at Layer 7 (Application Layer) of the OSI model, meaning it can understand HTTP and HTTPS.
### Ingress
Using an Ingress object you can define a set of rules for how traffic should be routed. For example, you can set up rules to send traffic for "www.example.com/shop" to a "shop" service and "www.example.com/blog" to a "blog" service. The only work of ingress is to maintain the routing rules.
**Detailed Blog:** [Kubernetes Ingress Explained](https://devopscube.com/kubernetes-ingress-tutorial/)
### Ingress Controller
An Ingress Controller is the part that actually makes the Ingress work. It is software that runs inside your cluster and listens for changes to DNS/routing rules from the Ingress object. The software could be Nginx, HAproxy, evvoy, etc.
**Detailed Blog:** [Ingress Controller Explained With Practical Example](https://devopscube.com/setup-ingress-kubernetes-nginx-controller/)
**Ingress Controller Comparison** [Comparison of Kubernetes Ingress controllers](https://docs.google.com/spreadsheets/d/191WWNpjJ2za6-nbG4ZoUMXMpUK8KlCIosvQB0f-oq3k/edit#gid=907731238)
### Gateway API
Gateway API is like an upgraded version of the Ingress system. It lets you define how traffic should be handled in a more detailed way. For example, you can specify different kinds of load balancing, or set up more complex routing rules.
**Official Documentation:** [Gateway API Documentation](https://gateway-api.sigs.k8s.io/)
## Learn to Implement Network Policy
Kubernetes Network Policy is like a set of rules for how pods can talk to each other.
- **Detailed Doc:** [Kubernetes Network Policy](https://kubernetes.io/docs/concepts/services-networking/network-policies/)
- **Useful Tool:** [Network Policy Editor](https://editor.networkpolicy.io/)
- **Examples**: [Network Policy Recipes](https://github.com/ahmetb/kubernetes-network-policy-recipes)## Learn Kubernetes Logging & Monitoring
Logging is an important aspect of Kubernetes.
**Detailed Logging Guide:** [Kubernetes Logging](https://devopscube.com/kubernetes-logging-tutorial/)
## Learn About Securing Kubernetes Cluster
Securing a Kubernetes cluster is not just a good practice; it's a necessity.
- The first reason is data protection. You store a lot of sensitive information in your cluster, and the last thing you want is unauthorized access to it.
- Second, there's the integrity of your system. If someone gains unauthorized access, they can disrupt your operations, leading to downtime and potentially significant financial loss.
- Compliance is another big factor, especially for businesses in regulated industries like healthcare or finance. You have to meet certain security standards, and a secure cluster is a step in that direction.
- Lastly, there's the cost factor. Dealing with a security breach can be far more expensive than investing in security measures upfront.- **Kubernetes CIS Benchmarking**: [CIS Benchmarking using Kube-bench](https://devopscube.com/kube-bench-guide/)Blog
- **Runtime Security** [Getting Started With Falco](https://sysdig.com/blog/intro-runtime-security-falco/)Blog
- **Policy Enforcement**: [Open Policy Agent Guide](https://spacelift.io/blog/what-is-open-policy-agent-and-how-it-works)Blog## Kubernetes Advanced Concepts
Now that you have a basic understanding and practical knowledge of Kubernetes, you can begin exploring advanced Kubernetes concepts to further increase your knowledge.
[**Admission Controllers:**](https://kubernetes.io/blog/2019/03/21/a-guide-to-kubernetes-admission-controllers/)Official Blog These are plugins that intercept requests to the Kubernetes API server before the persistence of the object, but after the request is authenticated and authorized.
[**Dynamic Admission Controller:**](https://kubernetes.io/docs/reference/access-authn-authz/extensible-admission-controllers/)Official Doc These are HTTP callbacks that receive admission requests and let you implement custom admission logic. There are two types of admission webhooks:
- **Validating Admission Webhooks:** They can be used to perform validations on Kubernetes objects and reject requests that do not meet certain criteria.
- [**Mutating Admission Webhooks:**](https://medium.com/ibm-cloud/diving-into-kubernetes-mutatingadmissionwebhook-6ef3c5695f74)Blog They can be used to modify Kubernetes objects before they are stored (for example, to inject sidecar containers into pods).[**Custom Resource Definitions (CRDs):**](https://kubernetes.io/docs/concepts/extend-kubernetes/api-extension/custom-resources/)Official Doc Extend Kubernetes API to create custom resources.
[**Custom Resource & Controllers:**](https://kubernetes.io/docs/concepts/extend-kubernetes/api-extension/custom-resources/)Official Doc Custom Resources in Kubernetes are like adding your own new types of objects. Just like you have built-in objects like Pods, Deployments, and Services, you can create your own. This is useful when you want to introduce new concepts or configurations that Kubernetes doesn't already know about.
Custom Controllers are like the rules or instructions for what to do with your new custom resources. They watch for any changes to your custom resources and then make things happen in response.
[**Custom Schedulers:**](https://banzaicloud.com/blog/k8s-custom-scheduler/)Blog By default, Kubernetes uses a default scheduler to assign pods to nodes. However, you might have specific scheduling requirements that aren't addressed by the default scheduler. In such cases, you can create a custom scheduler that can coexist with the default scheduler. You can then specify which scheduler to use for each pod. Custom schedulers allow you to implement complex, custom scheduling logic that might be unique to your application's needs.
## Extending Kubernetes
The Kubernetes Operator pattern is a way to extend the capabilities of a Kubernetes cluster. It allows you to automate tasks that you would usually do manually.
An Operator is basically a set of custom k8s resources and a custom controller. The custom resources define what you want to happen, like the settings or features you want for an application. The custom controller watches these resources and makes sure the actual state matches what you've defined.
For example, let's say you have a database. Normally, you'd have to manually set it up, scale it, and handle backups. With an Operator, you can automate all these tasks. You just tell the Operator what you want by setting some custom resources, and it takes care of the rest.
One real-world example is the Prometheus Operator. If you want to set up Prometheus for monitoring in your cluster, you'd usually have to do a lot of manual work. The Prometheus Operator automates this. You just set your desired settings in a custom resource, and the Operator sets up and manages Prometheus for you.
**Kubernetes Operators:** [Kubernetes Operators Explained With Examples](https://blog.sparkfabrik.com/en/what-are-kubernetes-operators)
**Operator Framework:** [Operator Framework](https://operatorframework.io/)
**Python Operator Framework** [Kopf](https://github.com/nolar/kopf)
**CRD Framework** [Kubebuilder](https://github.com/kubernetes-sigs/kubebuilder)
**Custom Admission Webhooks** [Simple Kubernetes Admission Webhook](https://slack.engineering/simple-kubernetes-webhook/)
## Learn Kubernetes Templating Tools
Helm and Kustomize are both tools that are used to manage Kubernetes manifests. They are similar in many ways but have some key differences.
Helm is a package manager for Kubernetes that allows users to easily install, manage, and upgrade applications on a Kubernetes cluster. It uses a concept called "charts" which are pre-configured sets of Kubernetes resources that can be easily deployed, upgraded, and rolled back.
Kustomize, on the other hand, is a tool that allows users to customize and configure existing Kubernetes manifests. It uses a concept called "patches" which can be applied to existing manifests to customize them for different environments and use cases. Unlike Helm, Kustomize does not include built-in support for versioning and rollback, and does not have a concept of "packages" or "repositories".
- [Learn to Create Helm Chart From Scratch](https://devopscube.com/create-helm-chart/)Hands-On Blog
- [Kuztomize Crash Course](https://techiescamp.com/p/kubernetes-kustomize-crash-course)Free Course
- [Making the most out of Helm templates](https://blog.palark.com/advanced-helm-templating/)Blog## Kubernetes Deployment Tools (GitOps Based)
GitOps is a technical practice that uses Git as a single source of truth for declarative infrastructure and application code.
- [Guide to GitOps](https://www.weave.works/technologies/gitops/)Official Doc
Some popular GitOps-based tools for deploying applications to Kubernetes clusters are:
- [Argo CD](https://argo-cd.readthedocs.io/en/stable/)Official Doc
- [Argo Rollouts](https://argo-rollouts.readthedocs.io/en/stable/)Official Doc
- [FluxCD](https://fluxcd.io/)Official Doc
- [JenkinsX](https://jenkins-x.io/)Official Doc## Learn Kubernetes Production Best Practices
- [Learn About 12 Factor Apps](https://12factor.net/) Official Guide
- [Production Readiness Checklist](https://learnk8s.io/production-best-practices)Blog
- [Recycling Kubernetes Nodes - Yelp](https://engineeringblog.yelp.com/2023/01/recycling-kubernetes-nodes.html)Blog## Understand Capacity Planning
Capacity planning is a key aspect of Kubernetes implementation. It's essential for cost savings, performance, resource allocation, scalability, and optimization, among other things.
- [Kubernetes Node Capacity](https://www.densify.com/kubernetes-autoscaling/kubernetes-node-capacity/) Blog
- [Rightsize the requests](https://sysdig.com/blog/kubernetes-capacity-planning/)Blog
- [Kubernetes Instance Calculator](https://learnk8s.io/kubernetes-instance-calculator)Tool & Doc
- [Saving Millions of Dollars by Bin-Packing ClickHouse Pods in AWS EKS](https://clickhouse.com/blog/packing-kubernetes-pods-more-efficiently-saving-money)Blog## Real-World Kubernetes Case Studies
If you do not have real-world Kubernetes experience, it is better to read case studies of other companies using kubernetes.
- [List of Kubernetes User Case Studies](https://kubernetes.io/case-studies/)Official Case Studies
- [Scheduling 300,000 Kubernetes Pods in Production Daily](https://www.youtube.com/watch?v=wjy35HfIP_k) Video
- [How OpenAI Scaled Kubernetes to 7,500 Nodes](https://openai.com/blog/scaling-kubernetes-to-7500-nodes/)Blog
- [Testing 500 Pods Per Node](https://cloud.redhat.com/blog/500_pods_per_node)Blog
- [Dynamic Kubernetes Cluster Scaling at Airbnb](https://medium.com/airbnb-engineering/dynamic-kubernetes-cluster-scaling-at-airbnb-d79ae3afa132)Blog
- [Scaling 100 to 10,000 pods on Amazon EKS](https://aws.amazon.com/blogs/containers/scale-from-100-to-10000-pods-on-amazon-eks)Blog
- [Kubernetes Infrastructure At Medium](https://medium.engineering/kubernetes-infrastructure-at-medium-d9e2444932ef)Blog
- [EKS architecture to improve resiliency](https://aws.amazon.com/blogs/containers/life360s-journey-to-a-multi-cluster-amazon-eks-architecture-to-improve-resiliency/)Blog
- [Scaling Amazon EKS to thousands of nodes](https://aws.amazon.com/blogs/containers/mobileyes-journey-towards-scaling-amazon-eks-to-thousands-of-nodes/)Blog## Kubernetes Failures/Learnings
- [Learn From Kubernetes Failure Stories](https://k8s.af/) List of Blogs
- [Reddit: The Pi-Day Outage](https://www.reddit.com/r/devops/comments/11zvig0/you_broke_reddit_the_piday_outage/)Blog
- [How a Production Outage Was Caused Using Kubernetes Pod Priorities](https://grafana.com/blog/2019/07/24/how-a-production-outage-was-caused-using-kubernetes-pod-priorities/)Blog## AWS EKS Resources
- [EKS Workshop](https://www.eksworkshop.com/)
- [EKS Best Practices](https://aws.github.io/aws-eks-best-practices/)
- [EKS Hardening](https://github.com/aws-samples/hardeneks)
- [EKS Helm Charts](https://github.com/aws/eks-charts)
- [EKS Blueprints](https://aws-quickstart.github.io/cdk-eks-blueprints/)