GPU COMPUTING
HIVE Stock Drops 11% After Announcing $75M Raise for AI Data Centers
HIVE Digital plans zero-interest notes offering to fund GPU expansion as Bitcoin miners accelerate pivot toward AI infrastructure.
NVIDIA Launches ALCHEMI Toolkit for GPU-Accelerated Chemistry Simulations
NVIDIA releases ALCHEMI Toolkit enabling researchers to build custom atomistic simulation workflows with up to 33x speedups for batched molecular dynamics on GPUs.
NVIDIA nvCOMP Cuts AI Training Checkpoint Costs by $56K Monthly
New GPU compression library reduces LLM training checkpoint sizes by 25-40%, saving teams up to $222K monthly on large-scale model training infrastructure.
NVIDIA Open-Sources Slinky to Run Slurm GPU Workloads on Kubernetes
NVIDIA's Slinky project enables running Slurm clusters on Kubernetes, already deployed on 8,000+ GPU systems for large-scale AI training infrastructure.
NVIDIA Scales AlphaFold-Multimer for Proteome-Wide Protein Complex Prediction
NVIDIA researchers detail GPU-accelerated pipeline extending AlphaFold Database with large-scale protein complex predictions using H100 clusters and optimized workflows.
NVIDIA Unveils Mission Control Software for Blackwell AI Supercomputers
NVIDIA's Mission Control bridges rack-scale GPU hardware with AI workload schedulers, enabling topology-aware job placement on GB200 and GB300 NVL72 systems.
NVIDIA Nsight Tools Slash Vision AI Decode Times by 85% in New VC-6 Batch Mode
NVIDIA's optimized VC-6 batch mode achieves submillisecond 4K image decoding, delivering up to 85% faster per-image processing for AI training pipelines.
NVIDIA GH200 Hits 4.6 Microsecond Latency in Trading Benchmark
NVIDIA's Grace Hopper Superchip achieves record single-digit microsecond inference times in STAC-ML benchmark, challenging FPGA dominance in algorithmic trading.
NVIDIA CUDA 13.2 Update: Latest CUDA News Today (Ampere & Ada GPUs)
CUDA 13.2 extends tile-based GPU programming to older architectures, adds Python profiling tools, and delivers up to 5x speedups with new Top-K algorithms.
NVIDIA Donates GPU Resource Driver to Kubernetes Open Source Project
NVIDIA transfers critical GPU allocation software to CNCF at KubeCon Europe, marking major shift toward community-governed AI infrastructure.
NVIDIA CCCL 3.1 Adds Floating-Point Determinism Controls for GPU Computing
NVIDIA's CCCL 3.1 introduces three determinism levels for parallel reductions, letting developers trade performance for reproducibility in GPU computations.
NVIDIA cuda.compute Brings C++ GPU Performance to Python Developers
NVIDIA's new cuda.compute library topped GPU MODE benchmarks, delivering CUDA C++ performance through pure Python with 2-4x speedups over custom kernels.
NVIDIA Launches GPU-Accelerated Endpoints for Moonshot AI's Kimi K2.5 Model
NVIDIA now offers free GPU-accelerated API access to Kimi K2.5, a 1T parameter multimodal AI model with 384 experts and 262K context length for developers.
NVIDIA Megatron Core Gets Dynamic-CP Update With 48% Training Speedups
NVIDIA releases Dynamic Context Parallelism for Megatron Core, achieving up to 1.48x faster LLM training and 35% gains in industrial deployments.
FlashAttention-4 Hits 1,605 TFLOPS on NVIDIA Blackwell GPUs
NVIDIA's FlashAttention-4 achieves 71% hardware efficiency on Blackwell chips, delivering 3.6x speedup over FA2 for AI training workloads.