Container/Kubernetes Management
When Your Cluster Won’t Sit Still: The Hidden Cost of Kubernetes Autonomy During Incidents
I’ve spent the better part of the last few years on the receiving end of Kubernetes pages, both as an operator and as someone building tooling for platform teams. The pattern I’ve ...
Pod Disruption Budgets: A Field Guide to What Actually Works
In Kubernetes, PodDisruptionBudgets are simple to write, easy to misuse, and cause more “why won’t this node drain?” confusions than any other Kubernetes primitive. After tracing too many node lifecycle automation problems ...
DevZero Launches Automation Platform to Dynamically Rightsize Kubernetes Clusters
DevZero today launched an autonomous infrastructure optimization platform for Kubernetes clusters based on a profiler that continuously monitors clusters, nodes, and individual workloads to build statistical models of demand for resources. Company ...
Stop Wasting GPU Budget: Autoscaling AI Inference on Kubernetes with KEDA
The rush to deploy Large Language Models (LLMs) and generative AI has created a massive infrastructure bottleneck. Platform engineering teams are spinning up expensive GPU node pools on Kubernetes, but they are ...
Ten Years of the Operator Pattern: What We Got Right, What We’d Change
CoreOS introduced the operator pattern in November 2016, and nearly a decade later operators are everywhere. Almost every CNCF graduated project ships one, every database vendor offers one, and every platform team ...
Red Hat Expands OpenShift Application Development Environment
Red Hat this week added a bevy of additional capabilities to its OpenShift platform, including adding support for live migration of virtual machines and making generally available a set of hardened container ...
The Questions Every Team Asks About Docker Sandboxes
Docker Sandboxes launched in March 2026. Since then, I’ve heard the same questions at meetups, on Slack, and during Docker Captain briefings. Instead of writing another overview piece, I want to answer ...
Kubernetes in Production: Where Platform Decisions Break Down
Kubernetes is often described as “free,” but that assumption falls apart in production. What looks like a complete platform is only a foundation. Everything required to run real workloads reliably sits outside ...
Solo.io Extends kagent Runtime to NemoClaw Governance Framework for AI Agents
Solo.io this week added support for the open source NemoClaw framework for safely deploying artificial intelligence (AI) agents in a kagent runtime environment on Kubernetes that is being advanced under the auspices ...
Trilio Extends Disaster Recovery Reach to Red Hat OpenShift Virtualization
Trilio is making available a technology preview of an instance of its disaster recovery (DR) platform that supports Red Hat OpenShift Virtualization, which enables IT teams to encapsulate virtual machines in a ...

