NVIDIA
NVIDIA Run:ai Enhances AI Model Orchestration on AWS
NVIDIA Run:ai on AWS Marketplace offers a streamlined approach to GPU infrastructure management for AI workloads, integrating with key AWS services to optimize performance.
NVIDIA Dynamo Expands AWS Support for Enhanced AI Inference Efficiency
NVIDIA Dynamo now supports AWS services, offering developers enhanced efficiency for large-scale AI inference. The integration promises performance improvements and cost savings.
NVIDIA Extends Deadline for Project G-Assist Plug-In Hackathon
NVIDIA extends the deadline for its Project G-Assist Plug-In Hackathon, offering participants a chance to win top-tier GeForce RTX hardware and more.
NVIDIA Riva TTS Enhances Multilingual Speech and Voice Cloning
NVIDIA introduces Riva TTS models enhancing multilingual speech synthesis and voice cloning, with applications in AI agents, digital humans, and more, featuring advanced architecture and preference alignment.
NVIDIA CEO Jensen Huang Advocates for AI Advancement in U.S. and China
NVIDIA's CEO Jensen Huang emphasizes AI's global benefits during meetings in Washington, D.C. and Beijing, highlighting initiatives to boost AI infrastructure and innovation.
Enhancing AI Training: NVIDIA's NCCL Advances Cross-Data Center Communication
NVIDIA's NCCL introduces enhanced cross-data center communication features, optimizing AI training by leveraging network topology awareness and supporting multiple data centers with minimal modifications.
NVIDIA Unveils NCCL 2.27: Enhancing AI Training and Inference Efficiency
NVIDIA launches NCCL 2.27 to improve AI workloads with faster GPU communication, lower latency, and enhanced resilience, addressing the demands of modern AI infrastructures.
AI-Powered Climate Models: Revolutionizing Climate Forecasting with ClimSim-Online
NVIDIA introduces ClimSim-Online, an AI-powered framework revolutionizing climate modeling by integrating machine learning with traditional climate simulators, enhancing speed and accuracy in climate forecasts.
NVIDIA NeMo-RL Utilizes GRPO for Advanced Reinforcement Learning
NVIDIA introduces NeMo-RL, an open-source library for reinforcement learning, enabling scalable training with GRPO and integration with Hugging Face models.
NVIDIA Expands Python Capabilities with CUDA Kernel Fusion Tools
NVIDIA introduces cuda.cccl, bridging the gap for Python developers by providing essential building blocks for CUDA kernel fusion, enhancing performance across GPU architectures.
NVIDIA's Helix Parallelism Revolutionizes AI with Multi-Million Token Inference
NVIDIA introduces Helix Parallelism, a breakthrough in AI, enabling faster real-time inference with multi-million-token contexts, enhancing performance and user experience.
NVIDIA Boosts AI Factories With DPU-Enhanced Kubernetes Service Proxy
NVIDIA advances AI applications with DPU-accelerated service proxies for Kubernetes, enhancing performance, efficiency, and security for AI clouds according to NVIDIA.
NVIDIA Enhances cuQuantum with Dynamic Gradients and DMRG Primitives
NVIDIA's cuQuantum SDK introduces dynamic gradients, DMRG primitives, and performance improvements, enhancing quantum computing emulations on Tensor Core GPUs.
RAPIDS Introduces GPU Polars Streaming and Unified GNN API Enhancements
NVIDIA's RAPIDS suite version 25.06 unveils new features including GPU Polars streaming, a unified GNN API, and zero-code ML speedups, enhancing Python data science capabilities.
NVIDIA Unveils Data Flywheel Blueprint to Optimize AI Agents
NVIDIA introduces the Data Flywheel Blueprint, a workflow aimed at enhancing AI agents by reducing costs and improving efficiency using automated experimentation and self-improving loops.