Kubernetes AI workloads
Open Source KServe AI Inference Platform Becomes CNCF Project
The CNCF adopts KServe to strengthen cloud-native AI inference on Kubernetes as platforms like Red Hat OpenShift AI expand model-as-a-service capabilities ...
Google Extends Kubernetes Service to Safely Run Agentic AI Workloads
At KubeCon + CloudNativeCon North America 2025, Google unveiled major GKE upgrades — including an AI sandbox, inference gateway, pod snapshots, and 130,000-node clusters — to optimize and secure agentic AI workloads ...
GPU Resource Management for Kubernetes Workloads: From Monolithic Allocation to Intelligent Sharing
AI and ML workloads in Kubernetes are evolving fast—but traditional GPU allocation leads to massive waste and inefficiency. Learn how intelligent GPU allocation, leveraging technologies like MIG, MPS, and time-slicing, enables smarter, ...
Ashfaq Munshi | | AI infrastructure optimization, AI workload orchestration, AI/ML GPU efficiency, GPU cost efficiency, GPU efficiency in AI workloads, GPU overprovisioning, GPU partitioning technologies, GPU resource allocation strategies, GPU resource management, GPU sharing in Kubernetes, GPU time-slicing, GPU utilization optimization, GPU workload rightsizing, intelligent GPU allocation, Kubernetes AI workloads, Kubernetes GPU performance, Kubernetes GPU scheduling, multi-instance GPU, multi-process service, NVIDIA MIG, NVIDIA MPS
Fitting Square Kubernetes Into the Round AI-Native Apps
Kubernetes tamed cloud-native workloads, but AI-native apps push its limits. Can it evolve for GPU-first, data-intensive AI — or is it time for new control planes? ...
Alan Shimel | | AI control plane, AI infrastructure, AI pipelines Kubernetes, AI-native applications, cloud-native vs AI-native, container orchestration AI, distributed training orchestration, GPU scheduling, inference at scale, internal developer platforms, Kubeflow, KubeRay, kubernetes, Kubernetes AI workloads, Kubernetes future, Kubernetes limitations, Kubernetes vs AI, platform engineering, Ray on Kubernetes, Volcano scheduler

