Machine Learning Infrastructure News | Blockchain.News

MACHINE LEARNING INFRASTRUCTURE

NVIDIA cuda.compute Brings C++ GPU Performance to Python Developers
Machine Learning Infrastructure

NVIDIA cuda.compute Brings C++ GPU Performance to Python Developers

NVIDIA's new cuda.compute library topped GPU MODE benchmarks, delivering CUDA C++ performance through pure Python with 2-4x speedups over custom kernels.

NVIDIA Achieves 36% Training Speedup for 256K Token AI Models
Machine Learning Infrastructure

NVIDIA Achieves 36% Training Speedup for 256K Token AI Models

NVIDIA's NVSHMEM integration with XLA compiler delivers up to 36% faster training for long-context LLMs, enabling efficient 256K token sequence processing on JAX.