NVIDIA
NVIDIA Triton Inference Server Excels in MLPerf Inference 4.1 Benchmarks
NVIDIA Triton Inference Server achieves exceptional performance in MLPerf Inference 4.1 benchmarks, demonstrating its capabilities in AI model deployment.
NVIDIA Simplifies Camera Calibration for Enhanced AI Multi-Camera Tracking
NVIDIA introduces streamlined camera calibration processes to boost the accuracy of AI-powered multi-camera tracking applications.
NVIDIA NIM Enhances RAG Applications for Veterinary AI
NVIDIA NIM improves retrieval-augmented generation (RAG) applications, streamlining AI solutions in specialized fields like veterinary science.
NVIDIA Unveils Spectrum-X to Enhance Large-Scale AI Workloads
NVIDIA introduces Spectrum-X, a high-performance Ethernet fabric, to optimize AI workloads and enhance data center efficiency.
NVIDIA and Partners Unveil NIM Agent Blueprints to Accelerate AI Application Development
NVIDIA and global partners have launched NIM Agent Blueprints to help enterprises build and deploy generative AI applications for various use cases.
NVIDIA NIM Agent Blueprints Propel Enterprise Generative AI
NVIDIA launches NIM Agent Blueprints to fast-track enterprise generative AI applications, enhancing business objectives with advanced AI tools.
NVIDIA's NIM Agent Blueprint Transforms Drug Discovery with AI-Driven Virtual Screening
NVIDIA's NIM Agent Blueprint leverages generative AI for faster, cost-effective drug discovery, enhancing hit-to-lead transitions.
NVIDIA Introduces NIM Microservices for Generative AI in Japan and Taiwan
NVIDIA launches NIM microservices to support generative AI in Japan and Taiwan, enhancing regional language models and local AI applications.
NVIDIA Unveils New CUDA Libraries, Promises Major Speed and Efficiency Gains
NVIDIA introduces new CUDA libraries to enhance accelerated computing, offering substantial speed and energy efficiency improvements across various applications.
NVIDIA AI Workbench Simplifies GPU Utilization on Windows
NVIDIA's AI Workbench streamlines data science, ML, and AI projects across PCs, workstations, datacenters, and cloud environments.
NVIDIA's CUDA-Q Reduces Resources for Quantum Clustering Algorithms
NVIDIA’s CUDA-Q platform enables significant resource reduction in quantum clustering algorithms, making them more feasible for near-term quantum computing applications.
NVIDIA to Showcase Data Center Innovations at Hot Chips 2024
NVIDIA engineers to unveil advancements in the Blackwell platform, liquid cooling, and AI-driven chip design at Hot Chips 2024.
AI21 Labs Unveils Jamba 1.5 LLMs with Hybrid Architecture for Enhanced Reasoning
AI21 Labs introduces Jamba 1.5, a new family of large language models leveraging hybrid architecture for superior reasoning and long context handling.
Google Cloud Run Integrates NVIDIA L4 GPUs for Enhanced AI Inference Deployments
Google Cloud Run now supports NVIDIA L4 GPUs, NVIDIA NIM, and serverless AI inference deployments, optimizing performance and scalability for AI applications.
NVIDIA Unveils Mistral-NeMo-Minitron 8B: Compact Language Model with High Accuracy
NVIDIA releases Mistral-NeMo-Minitron 8B, a compact language model delivering state-of-the-art accuracy, optimized for various AI applications.