GPU
Render Network Expands with RNP-021 for Enhanced AI and GPU Capabilities
Render Network introduces RNP-021 to support enterprise-grade GPUs, enhancing AI and video rendering capabilities. The initiative aims to meet the growing demand in the GPU market.
Exploring NVIDIA's CDMM Mode for Enhanced Memory Management
NVIDIA introduces Coherent Driver-based Memory Management (CDMM) to improve GPU memory control on hardware-coherent platforms, addressing issues faced by developers and cluster administrators.
Revolutionizing Data Analytics: GPU-Native Velox and NVIDIA cuDF Integration
NVIDIA and IBM collaborate to integrate GPU-native Velox with NVIDIA cuDF, enhancing data analytics performance on platforms like Presto and Apache Spark.
NVIDIA RAPIDS 25.08 Enhances Data Science with New Profiling Tools and Algorithm Support
NVIDIA's RAPIDS 25.08 release introduces new profiling tools for cuML, updates to the Polars GPU engine, and additional algorithm support, enhancing data science accessibility and scalability.
NVIDIA Launches PyNvVideoCodec 2.0 for Enhanced Python Video Processing
NVIDIA's PyNvVideoCodec 2.0 introduces significant enhancements for GPU-accelerated video processing in Python, offering new features for AI, multimedia, and streaming applications.
Enhancing LLM Inference with CPU-GPU Memory Sharing
NVIDIA introduces a unified memory architecture to optimize large language model inference, addressing memory constraints and improving performance.
NVIDIA Introduces GPU Memory Swap to Optimize AI Model Deployment Costs
NVIDIA's GPU memory swap technology aims to reduce costs and improve performance for deploying large language models by optimizing GPU utilization and minimizing latency.
NVIDIA Unveils CUDA Toolkit 13.0 Enhancements for Jetson Thor
NVIDIA announces CUDA Toolkit 13.0 for Jetson Thor, featuring a unified Arm ecosystem, enhanced virtual memory, and improved GPU sharing, streamlining development for edge computing.
NVIDIA Unveils Blackwell Ultra GPU: A Leap in AI Factory Technology
NVIDIA introduces the Blackwell Ultra GPU, advancing AI capabilities with increased performance, memory, and efficiency, setting a new standard for AI factories.
GitHub to Deprecate GPU Machine Type in Codespaces by August 2025
GitHub announces the deprecation of GPU machine types in Codespaces by August 29, 2025, in line with Azure's NCv3-series VMs retirement. Developers are advised to prepare for this transition.
NVIDIA Enhances Vector Search with GPU-Accelerated cuVS for Real-Time Data Retrieval
NVIDIA's cuVS introduces GPU-accelerated vector search, optimizing indexing and retrieval for AI applications. The latest release enhances performance with new algorithms and integrations.
NVIDIA's CUTLASS 4.0: Advancing GPU Performance with New Python Interface
NVIDIA unveils CUTLASS 4.0, introducing a Python interface to enhance GPU performance for deep learning and high-performance computing, utilizing CUDA Tensors and Spatial Microkernels.
NVIDIA's CUTLASS 3.x Enhances GEMM Kernel Design with Modular Abstractions
NVIDIA's CUTLASS 3.x introduces a modular, hierarchical system for GEMM kernel design, improving code readability and extending support to newer architectures like Hopper and Blackwell.
NVIDIA Run:ai Enhances AI Model Orchestration on AWS
NVIDIA Run:ai on AWS Marketplace offers a streamlined approach to GPU infrastructure management for AI workloads, integrating with key AWS services to optimize performance.
NVIDIA Unveils NCCL 2.27: Enhancing AI Training and Inference Efficiency
NVIDIA launches NCCL 2.27 to improve AI workloads with faster GPU communication, lower latency, and enhanced resilience, addressing the demands of modern AI infrastructures.