https://github.com/netdata/awesome-troubleshooting
Great examples of real world troubleshooting case studies, tools and resources.
https://github.com/netdata/awesome-troubleshooting
List: awesome-troubleshooting
Last synced: 14 days ago
JSON representation
Great examples of real world troubleshooting case studies, tools and resources.
- Host: GitHub
- URL: https://github.com/netdata/awesome-troubleshooting
- Owner: netdata
- Created: 2022-11-17T13:55:08.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-11-17T14:14:25.000Z (over 3 years ago)
- Last Synced: 2025-02-14T08:02:05.024Z (about 1 year ago)
- Size: 1.95 KB
- Stars: 1
- Watchers: 8
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# awesome-troubleshooting
[](https://github.com/sindresorhus/awesome)
Great examples of real world troubleshooting case studies, tools and resources.
--------------------
## Table of contents
- [Videos](#videos)
- [Blog Posts](#blog-posts)
## Videos
**[`^ back to top ^`](#awesome-troubleshooting)**
- [Mentorship Session: Stress-ng: How to Stress Test Your Computer (and Possibly Break It)](https://www.youtube.com/watch?v=gD3Hn02VSHA): The stress-ng tool is designed to stress test kernels and hardware. The presentation describes the various ways stress-ng can be used to exercise a system as well as the testing concepts that shape the project.
- [Kubernetes throttling? It doesn't have to suck!](https://youtu.be/KCFFZ_qfKXk): Kubernetes has a bad habit of throttling CPU resources—leading to severely degraded performance or spiralling costs for extra, unnecessary infrastructure.
We put a K8s cluster in the lab to monitor how and when CPU throttling happens.
## Blog Posts
**[`^ back to top ^`](#awesome-troubleshooting)**
- [Debugging Latency](https://exa.net.uk/latency-debugging/): For the past couple of weeks, our active monitoring detected several occasions where the latency of traffic going through our SurfProtect service increased significantly enough to be worthy of investigation.
- [Debugging Latency – Redux](https://exa.net.uk/debugging-latency-redux/): Not so long ago, we posted an [article](https://exa.net.uk/latency-debugging/) that detailed an issue impacting SurfProtect service stability. We described our investigation into the cause of the problem and celebrated its resolution.