NVIDIA
NVIDIA Enhances Vector Search with GPU-Accelerated cuVS for Real-Time Data Retrieval
NVIDIA's cuVS introduces GPU-accelerated vector search, optimizing indexing and retrieval for AI applications. The latest release enhances performance with new algorithms and integrations.
NVIDIA's NeMo Framework Enables Weekend Training of Reasoning-Capable LLMs
NVIDIA introduces an efficient method to train reasoning-capable language models over a weekend using the NeMo framework, leveraging the Llama Nemotron dataset and LoRA adapters.
Financial Services Revolutionized by Agentic AI for Enhanced Efficiency and Security
Discover how financial services are leveraging agentic AI to boost productivity, enhance security, and improve customer service, according to NVIDIA's insights.
NVIDIA Introduces Advanced Robotic Simulation with Warp and Gaussian Splatting
Explore NVIDIA's innovative approach to robotic simulations using Warp and Gaussian Splatting, facilitating real-time digital twin creation for enhanced robotic perception and interaction.
Dynamic Knowledge Enhances AI Agents with Agentic RAG
Exploring how dynamic knowledge and Agentic RAG elevate AI agents' efficiency and adaptability beyond traditional RAG systems, revolutionizing their real-time decision-making capabilities.
Enhancing Inference Efficiency: NVIDIA's Innovations with JAX and XLA
NVIDIA introduces advanced techniques for reducing latency in large language model inference, leveraging JAX and XLA for significant performance improvements in GPU-based workloads.
NVIDIA NeMo Agent Toolkit Hackathon Showcases Innovative AI Solutions
The NVIDIA NeMo Agent Toolkit Hackathon highlighted groundbreaking AI projects, showcasing the tool's potential in logistics, software development, and travel planning through innovative multi-agent AI workflows.
Together AI Achieves Breakthrough Inference Speed with NVIDIA's Blackwell GPUs
Together AI unveils the world's fastest inference for the DeepSeek-R1-0528 model using NVIDIA HGX B200, enhancing AI capabilities for real-world applications.
UK's Isambard-AI Supercomputer Goes Live, Setting New Standards
The University of Bristol unveils Isambard-AI, the UK's most powerful AI supercomputer, delivering 21 exaflops of performance and ranking among the top in energy efficiency globally.
NVIDIA's CUTLASS 4.0: Advancing GPU Performance with New Python Interface
NVIDIA unveils CUTLASS 4.0, introducing a Python interface to enhance GPU performance for deep learning and high-performance computing, utilizing CUDA Tensors and Spatial Microkernels.
NVIDIA Introduces Safety Measures for Agentic AI Systems
NVIDIA has launched a comprehensive safety recipe to enhance the security and compliance of agentic AI systems, addressing risks such as prompt injection and data leakage.
Enhancing ML Models in Semiconductor Manufacturing with NVIDIA CUDA-X
NVIDIA's CUDA-X Data Science libraries optimize feature engineering in semiconductor manufacturing, enhancing ML model performance and reducing ETL processing time by up to 40%.
NVIDIA Isaac Advances AI Robotics in Healthcare Amid Worker Shortage
NVIDIA Isaac for Healthcare is revolutionizing AI-powered robotics to address the global healthcare worker shortage, enhancing medical procedures and patient care through advanced simulation and AI integration.
NVIDIA's CUTLASS 3.x Enhances GEMM Kernel Design with Modular Abstractions
NVIDIA's CUTLASS 3.x introduces a modular, hierarchical system for GEMM kernel design, improving code readability and extending support to newer architectures like Hopper and Blackwell.
NVIDIA Advances Robotics with New AI Training Models
NVIDIA's latest research introduces innovative AI models and workflows to enhance robot training, enabling efficient learning and adaptability across diverse environments.