Search Results for "gpu"
Efficient AI Pipelines: NVIDIA's NeMo Retriever Extraction on a Single GPU
NVIDIA's NeMo Retriever offers a streamlined solution for multimodal document extraction using a single GPU, enhancing AI pipelines' efficiency and reducing operational costs.
NVIDIA Enhances Multi-GPU Communication with NCCL 2.26 Release
NVIDIA's NCCL 2.26 introduces performance enhancements, improved monitoring, and quality of service features, optimizing multi-GPU and multinode communications for AI and HPC applications.
RAPIDS Introduces GPU Polars Streaming and Unified GNN API Enhancements
NVIDIA's RAPIDS suite version 25.06 unveils new features including GPU Polars streaming, a unified GNN API, and zero-code ML speedups, enhancing Python data science capabilities.
NVIDIA Unveils NCCL 2.27: Enhancing AI Training and Inference Efficiency
NVIDIA launches NCCL 2.27 to improve AI workloads with faster GPU communication, lower latency, and enhanced resilience, addressing the demands of modern AI infrastructures.
NVIDIA's CUTLASS 3.x Enhances GEMM Kernel Design with Modular Abstractions
NVIDIA's CUTLASS 3.x introduces a modular, hierarchical system for GEMM kernel design, improving code readability and extending support to newer architectures like Hopper and Blackwell.
NVIDIA Run:ai Enhances AI Model Orchestration on AWS
NVIDIA Run:ai on AWS Marketplace offers a streamlined approach to GPU infrastructure management for AI workloads, integrating with key AWS services to optimize performance.
NVIDIA's CUTLASS 4.0: Advancing GPU Performance with New Python Interface
NVIDIA unveils CUTLASS 4.0, introducing a Python interface to enhance GPU performance for deep learning and high-performance computing, utilizing CUDA Tensors and Spatial Microkernels.
NVIDIA Enhances Vector Search with GPU-Accelerated cuVS for Real-Time Data Retrieval
NVIDIA's cuVS introduces GPU-accelerated vector search, optimizing indexing and retrieval for AI applications. The latest release enhances performance with new algorithms and integrations.
GitHub to Deprecate GPU Machine Type in Codespaces by August 2025
GitHub announces the deprecation of GPU machine types in Codespaces by August 29, 2025, in line with Azure's NCv3-series VMs retirement. Developers are advised to prepare for this transition.
NVIDIA Unveils Blackwell Ultra GPU: A Leap in AI Factory Technology
NVIDIA introduces the Blackwell Ultra GPU, advancing AI capabilities with increased performance, memory, and efficiency, setting a new standard for AI factories.
NVIDIA Unveils CUDA Toolkit 13.0 Enhancements for Jetson Thor
NVIDIA announces CUDA Toolkit 13.0 for Jetson Thor, featuring a unified Arm ecosystem, enhanced virtual memory, and improved GPU sharing, streamlining development for edge computing.
NVIDIA Introduces GPU Memory Swap to Optimize AI Model Deployment Costs
NVIDIA's GPU memory swap technology aims to reduce costs and improve performance for deploying large language models by optimizing GPU utilization and minimizing latency.