Gpu News

Gpu

NVIDIA Enhances Vector Search with GPU-Accelerated cuVS for Real-Time Data Retrieval

NVIDIA's cuVS introduces GPU-accelerated vector search, optimizing indexing and retrieval for AI applications. The latest release enhances performance with new algorithms and integrations.

by Ted Hisokawa
Jul 25, 2025

Gpu

NVIDIA's CUTLASS 4.0: Advancing GPU Performance with New Python Interface

NVIDIA unveils CUTLASS 4.0, introducing a Python interface to enhance GPU performance for deep learning and high-performance computing, utilizing CUDA Tensors and Spatial Microkernels.

by Ted Hisokawa
Jul 18, 2025

Gpu

NVIDIA's CUTLASS 3.x Enhances GEMM Kernel Design with Modular Abstractions

NVIDIA's CUTLASS 3.x introduces a modular, hierarchical system for GEMM kernel design, improving code readability and extending support to newer architectures like Hopper and Blackwell.

by Caroline Bishop
Jul 17, 2025

Gpu

NVIDIA Run:ai Enhances AI Model Orchestration on AWS

NVIDIA Run:ai on AWS Marketplace offers a streamlined approach to GPU infrastructure management for AI workloads, integrating with key AWS services to optimize performance.

by Darius Baruo
Jul 16, 2025

Gpu

NVIDIA Unveils NCCL 2.27: Enhancing AI Training and Inference Efficiency

NVIDIA launches NCCL 2.27 to improve AI workloads with faster GPU communication, lower latency, and enhanced resilience, addressing the demands of modern AI infrastructures.

by Lawrence Jengar
Jul 15, 2025

Gpu

RAPIDS Introduces GPU Polars Streaming and Unified GNN API Enhancements

NVIDIA's RAPIDS suite version 25.06 unveils new features including GPU Polars streaming, a unified GNN API, and zero-code ML speedups, enhancing Python data science capabilities.

by Tony Kim
Jul 05, 2025

Gpu

Efficient AI Pipelines: NVIDIA's NeMo Retriever Extraction on a Single GPU

NVIDIA's NeMo Retriever offers a streamlined solution for multimodal document extraction using a single GPU, enhancing AI pipelines' efficiency and reducing operational costs.

by Lawrence Jengar
Jun 19, 2025

Gpu

NVIDIA Enhances Multi-GPU Communication with NCCL 2.26 Release

NVIDIA's NCCL 2.26 introduces performance enhancements, improved monitoring, and quality of service features, optimizing multi-GPU and multinode communications for AI and HPC applications.

by Darius Baruo
Jun 19, 2025

Gpu

Aethir and Bitfinex Host Insightful AMA on Decentralized GPU Infrastructure

Aethir and Bitfinex held an AMA session exploring decentralized GPU infrastructure, its impact on AI and gaming, and future plans involving the $ATH token.

by Felix Pinkston
Jun 06, 2025

Gpu

Aethir's Decentralized Infrastructure Gains Spotlight in Bitfinex AMA

Aethir's decentralized infrastructure for GPU computing was discussed in a recent AMA hosted by Bitfinex and BitFreedomGus, highlighting its impact on AI and gaming sectors.

by Timothy Morano
Jun 06, 2025

Gpu

Enhancing Molecular Dynamics with NVIDIA's Multi-Process Service

NVIDIA's Multi-Process Service optimizes GPU usage in molecular dynamics simulations, boosting throughput by running concurrent processes on a single GPU.

by Alvin Lang
Jun 04, 2025

Gpu

Kaggle Competition Winner Reveals Stacking Strategy with cuML

Kaggle Grandmaster Chris Deotte shares insights on winning the April 2025 Kaggle competition using stacking with cuML, leveraging GPU acceleration for fast and efficient modeling.

by Rongchai Wang
May 22, 2025

Gpu

Harnessing AI's Potential with Decentralized Compute Networks

Explore how decentralized compute networks address the rising demand for AI applications, offering scalable solutions through consumer-grade GPUs. Learn about real-world use cases and industry partnerships.

by Jessie A Ellis
May 18, 2025

Gpu

NVIDIA's cuEmbed Boosts GPU Performance for Embedding Lookups

NVIDIA unveils cuEmbed, a CUDA library that significantly enhances embedding lookups on GPUs, promising improved performance for recommendation systems and other applications.

by Caroline Bishop
May 16, 2025

Gpu

Enhancing Polars GPU Parquet Reader Performance with Chunked Reading and UVM

Explore how Polars GPU Parquet Reader boosts performance using chunked reading and Unified Virtual Memory, enhancing data processing capabilities for large datasets.

by Ted Hisokawa
Apr 11, 2025