KEDA

Stop Wasting GPU Budget: Autoscaling AI Inference on Kubernetes with KEDA

The rush to deploy Large Language Models (LLMs) and generative AI has created a massive infrastructure bottleneck. Platform engineering teams are spinning up expensive GPU node pools on Kubernetes, but they are ...

Pavan Madduri | June 8, 2026 | AI Inference, autoscaling, GPU Scaling, KEDA, kubernetes

CNCF, cloud native, NVIDIA, AI, Peritus microservices

Deploying Docker AI Agents on OCI and OKE

This guide details the architectural transition of AI agents from experimental scripts to "first-class production workloads" using Oracle Cloud Infrastructure (OCI) and Oracle Kubernetes Engine (OKE). It emphasizes a zero-trust, scalable approach ...

Pavan Madduri | May 13, 2026 | Agentic Architecture, AI, AI agents, containerization, Data Minimization, docker, Event-Driven Autoscaling, GitOps, infrastructure as code, kagent, KEDA, Kubernetes CRD, Kyverno, LLM Inference, MCP server, Model Context Protocol, oci, OCI Generative AI, OCI Vault, OCIR, OKE, OpenTelemetry, Oracle Kubernetes Engine, Production Workloads., Terraform, Virtual Nodes, Zero-Trust Security

Techstrong TV

Click full-screen to enable volume control

Watch latest episodes and shows

Tech Field Day Events

UPCOMING WEBINARS

CloudNativeNow.com
DevOps.com
Error

Modernizing Manufacturing: How to Move from Legacy Infrastructure to Cloud-Ready Operations

18 August 2026

Migrating Apache Solr Workloads to Amazon OpenSearch Service

28 July 2026

From Pilot to Production: AI that Delivers Business Outcomes

27 July 2026

DevOps in the Age of AI Native

17 August 2026

How the Biggest Banks Stay Fast, Nimble, and Multi-Vendor

5 August 2026

The Emergence of AI in Performance Engineering

30 July 2026

RSS Error: A feed could not be found at `https://securityboulevard.com/webinars/feed/`; the status code is `403` and content-type is `text/html; charset=UTF-8`