TAG
Debugging
2026-02-04
How We Broke the Kubernetes Job Controller with 7,500 Crashlooping Pods
Read about Anton's experience on how they broke Kubernetes Job Controller with 7500 Crashlooping Pods.
2023-11-08
From crashdump to clarity: A kernel debugging adventure
When making mistakes in operating system kernel code there is often no recovery apart from a full system reboot. Unlike when userspace processes crash there is nobody to pick up the pieces after a kernel crash so a full system reboot is the only option.

