Kubernetes News

Kubernetes

NVIDIA Dynamo Snapshot Tackles Kubernetes AI Cold-Start Problem

NVIDIA's Dynamo Snapshot reduces Kubernetes AI inference cold-start times, leveraging CRIU and GPU Memory Service for sub-5-second deployment speed.

by Timothy Morano
May 28, 2026

Kubernetes

NVIDIA Open-Sources Slinky to Run Slurm GPU Workloads on Kubernetes

NVIDIA's Slinky project enables running Slurm clusters on Kubernetes, already deployed on 8,000+ GPU systems for large-scale AI training infrastructure.

by Felix Pinkston
Apr 10, 2026

Kubernetes

NVIDIA MIG Boosts AI Infrastructure ROI by 33% Over Time-Slicing

New NVIDIA benchmarks show Multi-Instance GPU partitioning achieves 1.00 req/s per GPU versus 0.76 for time-slicing in production AI workloads.

by Jessie A Ellis
Mar 26, 2026

Kubernetes

NVIDIA Donates GPU Resource Driver to Kubernetes Open Source Project

NVIDIA transfers critical GPU allocation software to CNCF at KubeCon Europe, marking major shift toward community-governed AI infrastructure.

by Ted Hisokawa
Mar 24, 2026

Kubernetes

NVIDIA Advances AI Infrastructure With Disaggregated LLM Inference on Kubernetes

NVIDIA details new Kubernetes deployment patterns for disaggregated LLM inference using Dynamo and Grove, promising better GPU utilization for AI workloads.

by Terrill Dicki
Mar 23, 2026

Kubernetes

NVIDIA Launches AI Cluster Runtime to Standardize GPU Kubernetes Deployments

NVIDIA's new open-source AI Cluster Runtime project delivers validated, reproducible Kubernetes configurations for GPU clusters, targeting H100 and Blackwell accelerators.

by Ted Hisokawa
Mar 13, 2026

Kubernetes

NVIDIA Run:ai v2.24 Tackles GPU Scheduling Fairness for AI Workloads

NVIDIA's new time-based fairshare scheduling prevents GPU resource hogging in Kubernetes clusters, addressing critical bottleneck for enterprise AI deployments.

by Caroline Bishop
Jan 29, 2026

Kubernetes

Enhancing Kubernetes AI Cluster Stability with NVSentinel

NVIDIA introduces NVSentinel, an open-source tool designed to automate health monitoring and issue remediation in Kubernetes AI clusters, ensuring GPU reliability and minimizing downtime.

by Alvin Lang
Dec 09, 2025

Kubernetes

NVIDIA Grove Simplifies AI Inference on Kubernetes

NVIDIA introduces Grove, a Kubernetes API that streamlines complex AI inference workloads, enhancing scalability and orchestration of multi-component systems.

by Caroline Bishop
Nov 10, 2025

Kubernetes

Kubernetes Embraces Multi-Node NVLink for Enhanced AI Workloads

NVIDIA's GB200 NVL72 introduces ComputeDomains for efficient AI workload management on Kubernetes, facilitating secure, high-bandwidth GPU connectivity across nodes.

by Timothy Morano
Nov 10, 2025

Kubernetes

NVIDIA Enhances AI Inference with Dynamo and Kubernetes Integration

NVIDIA's Dynamo platform now integrates with Kubernetes to streamline AI inference management, offering improved performance and reduced costs for data centers, according to NVIDIA's latest updates.

by James Ding
Nov 10, 2025

Kubernetes

Ray Enhances Scheduling with New Label Selectors

Ray introduces label selectors, enhancing scheduling capabilities for developers, allowing more precise workload placement on nodes. The feature is a collaboration with Google Kubernetes Engine.

by Terrill Dicki
Nov 01, 2025

Kubernetes

NVIDIA Enhances AI Scalability with NIM Operator 3.0.0 Release

NVIDIA's NIM Operator 3.0.0 introduces advanced features for scalable AI inference, enhancing Kubernetes deployments with multi-LLM and multi-node capabilities, and efficient GPU utilization.

by Darius Baruo
Sep 11, 2025

Kubernetes

NVIDIA Boosts AI Factories With DPU-Enhanced Kubernetes Service Proxy

NVIDIA advances AI applications with DPU-accelerated service proxies for Kubernetes, enhancing performance, efficiency, and security for AI clouds according to NVIDIA.

by Caroline Bishop
Jul 07, 2025

Kubernetes

GitHub Rolls Out Actions Runner Controller 0.12.0 with Key Enhancements

GitHub's Actions Runner Controller 0.12.0 introduces support for OpenShift, vault-based secrets, and DinD improvements, enhancing security and reliability for developers.

by Felix Pinkston
Jun 14, 2025

KUBERNETES