GPU
NVIDIA (NVDA) Teams Up With AI Clouds to Scale Token Factories
NVIDIA partners with AI clouds to deploy large-scale AI factories, leveraging a new revenue-sharing model to meet soaring compute demand.
NVIDIA GQE Sets Benchmark for GPU-Accelerated Query Engines
NVIDIA's GPU Query Engine (GQE) leverages Grace Blackwell hardware to deliver 7.5x faster database performance compared to CPUs. Learn how.
NVIDIA Adds Vulkan Descriptor Heaps Support, Boosting GPU Performance
NVIDIA integrates VK_EXT_descriptor_heap in drivers and tools, enhancing Vulkan's resource binding efficiency for gaming and ray tracing.
AI Data Processing Shifts to GPUs: Key Trends and Impacts
AI pipelines are increasingly GPU-driven as inference-heavy workloads handle unstructured data, reshaping data processing and infrastructure demands.
NVIDIA Dynamo Snapshot Tackles Kubernetes AI Cold-Start Problem
NVIDIA's Dynamo Snapshot reduces Kubernetes AI inference cold-start times, leveraging CRIU and GPU Memory Service for sub-5-second deployment speed.
Bitcoin Miner HIVE Plans $3.5B AI Data Center in Canada
HIVE Digital Technologies announces a 320-MW AI-focused data center near Toronto, expanding its high-performance computing footprint.
NVIDIA and IREN Partner on 5GW AI Infrastructure Expansion
NVIDIA (NVDA) and IREN (IREN) announce a strategic alliance to deploy up to 5GW of AI infrastructure, including a $2.1B potential stock investment.
NVIDIA NVbandwidth Tool Gets Multi-Node Support for AI Infrastructure Testing
NVIDIA's NVbandwidth benchmarking tool now supports multi-node GPU clusters, enabling developers to measure bandwidth across NVLink connections at 397+ GB/s.
NVIDIA Vera Rubin Platform Hits Full Production With Seven New AI Chips
NVIDIA launches Vera Rubin platform with seven chips in full production, promising 10x inference cost reduction versus Blackwell. Partners shipping H2 2026.
NVIDIA Run:ai GPU Fractioning Delivers 77% Throughput at Half Allocation
NVIDIA and Nebius benchmarks show GPU fractioning achieves 86% user capacity on 0.5 GPU allocation, enabling 3x more concurrent users for mixed AI workloads.
NVIDIA Secures Massive Meta AI Deal for Millions of Blackwell and Rubin GPUs
Meta commits to multiyear NVIDIA partnership deploying millions of GPUs, Grace CPUs, and Spectrum-X networking across hyperscale AI data centers.
NVIDIA GPUs Slash Scientific Computing Times From 9 Months to 4 Hours
NVIDIA accelerated computing enables real-time experiment steering at major research facilities, reducing data analysis from months to hours using GPU-powered workflows.
IBM Achieves 100x Speedup With Quantum-GPU Hybrid Computing
IBM and partners demonstrate quantum-centric supercomputing breakthrough, combining QPUs with AMD and NVIDIA GPUs for 100x performance gains in chemistry simulations.
NVIDIA Run:ai v2.24 Tackles GPU Scheduling Fairness for AI Workloads
NVIDIA's new time-based fairshare scheduling prevents GPU resource hogging in Kubernetes clusters, addressing critical bottleneck for enterprise AI deployments.
NVIDIA ALCHEMI Toolkit-Ops Enhances AI-Driven Chemistry Simulations
NVIDIA unveils its ALCHEMI Toolkit-Ops, a GPU-accelerated solution aimed at transforming AI-driven simulations in chemistry and materials science.