Deploying on Kubernetes doesn't equal disaster recovery. My MSc research showed this clearly—comparing AWS EKS and GKE recovery scenarios with Velero backups taught invaluable, practical lessons.
Early in 2023, challenged by rising Kubernetes issues in production environments, I crafted an audit methodology to diagnose clusters, identify misconfigurations, and establish best practices. Delivered by summer, it enabled clients to transform reliability and performance.
Sometimes technical challenges just come out of nowhere—and ruin your day (or months). Let me tell you the epic tale of how a sudden DockerHub limitation threw my team into a mad scramble, some detective work and Bash scripting sessions.
In summer 2015, I stumbled upon TR-069 technology, unaware it would define my career. Today, GenieACS—an open-source ACS managing CPE devices—is central to my journey. I've contributed Docker-based solutions and simplified deployment with Docker Compose and Kubernetes.
A colleague spotted a strange sawtooth pattern in Grafana. Digging deeper into Kubernetes nodes pointed me toward suspiciously high disk I/O wait times linked to EFS. How was it solved?