TAG

Debugging

Kubernetes Job Controller - Drawn image of a postman drowning in post

2026-02-04

How We Broke the Kubernetes Job Controller with 7,500 Crashlooping Pods 

Read about Anton's experience on how they broke Kubernetes Job Controller with 7500 Crashlooping Pods.
Author: Konstantin Lanzet License: CC BY-SA 3.0 DEED

2023-11-08

From crashdump to clarity: A kernel debugging adventure

When making mistakes in operating system kernel code there is often no recovery apart from a full system reboot. Unlike when userspace processes crash there is nobody to pick up the pieces after a kernel crash so a full system reboot is the only option.