NVIDIA
Nexa.ai's Hyperlink Enhances AI Search on NVIDIA RTX PCs
Nexa.ai's Hyperlink app now boosts productivity on NVIDIA RTX PCs, offering faster AI-driven search and contextual insights for efficient content creation and research.
NVIDIA NCCL 2.28 Revolutionizes GPU Communication with New Device API
NVIDIA's latest NCCL 2.28 release introduces a device API, enhancing communication and computation fusion for GPU networks, boosting performance and efficiency.
NVIDIA's Gen AI Super-resolution Enhances Weather Predictions with Efficient Models
NVIDIA's Earth-2 platform leverages Gen AI to optimize weather prediction models, offering scalable solutions through CorrDiff, significantly improving efficiency and reducing computational costs.
NVIDIA's Breakthrough: 4x Faster Inference in Math Problem Solving with Advanced Techniques
NVIDIA achieves a 4x faster inference in solving complex math problems using NeMo-Skills, TensorRT-LLM, and ReDrafter, optimizing large language models for efficient scaling.
NVIDIA Grove Simplifies AI Inference on Kubernetes
NVIDIA introduces Grove, a Kubernetes API that streamlines complex AI inference workloads, enhancing scalability and orchestration of multi-component systems.
Kubernetes Embraces Multi-Node NVLink for Enhanced AI Workloads
NVIDIA's GB200 NVL72 introduces ComputeDomains for efficient AI workload management on Kubernetes, facilitating secure, high-bandwidth GPU connectivity across nodes.
NVIDIA Enhances AI Inference with Dynamo and Kubernetes Integration
NVIDIA's Dynamo platform now integrates with Kubernetes to streamline AI inference management, offering improved performance and reduced costs for data centers, according to NVIDIA's latest updates.
NVIDIA Introduces Interactive AI Agent for Enhanced Machine Learning Efficiency
NVIDIA unveils an AI agent that accelerates machine learning tasks using GPU technology, simplifying workflows and boosting efficiency through modular design and language model integration.
NVIDIA's ComputeEval 2025.2 Challenges LLMs with Advanced CUDA Tasks
NVIDIA expands ComputeEval with 232 new CUDA challenges, testing LLMs' capabilities in complex programming tasks. Discover the impact on AI-assisted coding.
NVIDIA Leaders Honored with Queen Elizabeth Prize for Engineering
NVIDIA's Jensen Huang and Bill Dally receive the Queen Elizabeth Prize for their pivotal work in AI and accelerated computing, marking a significant contribution to modern engineering.
NVIDIA's cuVS Boosts Faiss Vector Search Efficiency with GPU Acceleration
NVIDIA's cuVS integration with Faiss enhances GPU-accelerated vector search, offering faster index builds and lower search latency, crucial for managing large datasets.
NVIDIA Enhances PyTorch with NeMo Automodel for Efficient MoE Training
NVIDIA introduces NeMo Automodel to facilitate large-scale mixture-of-experts (MoE) model training in PyTorch, offering enhanced efficiency, accessibility, and scalability for developers.
Enhancing Biology Transformer Models with NVIDIA BioNeMo and PyTorch
NVIDIA's BioNeMo Recipes simplify large-scale biology model training with PyTorch, improving performance using Transformer Engine and other advanced techniques.
NVIDIA's R²D²: Revolutionizing Robot Manipulation with Advanced Task and Motion Planning
NVIDIA's R²D² advances robot manipulation with perception-guided task and motion planning, integrating vision, language, and GPU acceleration for enhanced adaptability in dynamic environments.
NVIDIA's OpenFold3 NIM: Advancing Biomolecular Structure Prediction
NVIDIA introduces OpenFold3 NIM, a transformative AI tool for biomolecular structure prediction, offering enhanced speed and accuracy for biopharma and biotech industries.