An open API service indexing awesome lists of open source software.

https://github.com/netdata/awesome-troubleshooting

Great examples of real world troubleshooting case studies, tools and resources.
https://github.com/netdata/awesome-troubleshooting

List: awesome-troubleshooting

Last synced: 14 days ago
JSON representation

Great examples of real world troubleshooting case studies, tools and resources.

Awesome Lists containing this project

README

          

# awesome-troubleshooting

[![Awesome](https://cdn.jsdelivr.net/gh/sindresorhus/awesome@d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)

Great examples of real world troubleshooting case studies, tools and resources.

--------------------

## Table of contents
- [Videos](#videos)
- [Blog Posts](#blog-posts)

## Videos

**[`^ back to top ^`](#awesome-troubleshooting)**

- [Mentorship Session: Stress-ng: How to Stress Test Your Computer (and Possibly Break It)](https://www.youtube.com/watch?v=gD3Hn02VSHA): The stress-ng tool is designed to stress test kernels and hardware. The presentation describes the various ways stress-ng can be used to exercise a system as well as the testing concepts that shape the project.
- [Kubernetes throttling? It doesn't have to suck!](https://youtu.be/KCFFZ_qfKXk): Kubernetes has a bad habit of throttling CPU resources—leading to severely degraded performance or spiralling costs for extra, unnecessary infrastructure.
We put a K8s cluster in the lab to monitor how and when CPU throttling happens.

## Blog Posts

**[`^ back to top ^`](#awesome-troubleshooting)**

- [Debugging Latency](https://exa.net.uk/latency-debugging/): For the past couple of weeks, our active monitoring detected several occasions where the latency of traffic going through our SurfProtect service increased significantly enough to be worthy of investigation.
- [Debugging Latency – Redux](https://exa.net.uk/debugging-latency-redux/): Not so long ago, we posted an [article](https://exa.net.uk/latency-debugging/) that detailed an issue impacting SurfProtect service stability. We described our investigation into the cause of the problem and celebrated its resolution.