GKE

Evolving Kubernetes and GKE for Gen AI Inference
The combination of foundational improvements in open-source Kubernetes and powerful, managed solutions on GKE represents a significant leap forward for any organization working with generative AI ...
Akshay Ram | | AI aware load balancing, AI aware routing, benchmark database, cloud-native applications, community driven effort, container orchestration, data driven decisions, developer velocity, Evolving Kubernetes, Gen AI inference, GKE, GKE features, GKE Inference Quickstart, GPUs, Inference Gateway, inference perf project, intelligent scheduling, Kubernetes primitives, KV cache utilization, large models, latency vs throughput curves, microservices, model replica routing, open source Kubernetes, request response patterns, scaling, seamless portability, specialized hardware, standardized benchmarking, tail latency reduction, throughput increase, total cost of ownership, TPU serving stack, TPUs, user experience, vLLM library

The AppDev Tech Field Day Report: GCP Cloud Run Demo and a GKE Comparison
Cloud Run and Google Kubernetes Engine are powerful tools for deploying containerized applications. But why exactly does GCP need two products? ...

Google Unfurls Managed Kubernetes Service for the Enterprise
Google this week launched an enterprise edition of its managed Google Kubernetes Engine (GKE) service through which it will manage fleets of clusters in addition to applying customer configurations and policy guardrails ...

Google Adds Additional Storage Service for GKE
As part of a broader expansion of its cloud storage services, Google is extending its Filestore Enterprise for accessing NFS-based storage to be accessible by Google Kubernetes Engine (GKE) clusters running on ...

Despite Google’s ‘Autopilot,’ Kubernetes is Still Hard
We are, obviously, a very long way from being able to put Kubernetes cluster management on “autopilot,” but Google’s new platform ostensibly moves us closer to achieving that goal. Google Kubernetes Engine ...

Google Brings Confidential Computing to GKE Service
Google announced today that it is making available confidential computing nodes that encrypt data while it is processing in beta on the managed Google Kubernetes Engine (GKE) service. At the same time, ...

Final Tech Previews Offer an Almost-Complete Glimpse of Rancher 2.0
DevOps teams or anyone who manages containers can now get their hands dirty testing the final tech previews of container management platform Rancher 2.0 shortly before its launch in a few weeks ...

What Kubernetes on Azure Means for Users and Docker
Valentine’s Day has come and gone, but Microsoft is continuing to shower the Docker community with love. The company recently announced official support for Kubernetes on Azure Container Service, a move that ...

Platform9 Adds Kubernetes SaaS Solution. How Novel Is It?
Do we need more Kubernetes as a service? Platform9 thinks so. This week, the company introduced what it calls the first implementation of Kubernetes as a managed service. Kubernetes, of course, is ...