CLOUD COMPUTING
NVIDIA Dynamo Expands AWS Support for Enhanced AI Inference Efficiency
NVIDIA Dynamo now supports AWS services, offering developers enhanced efficiency for large-scale AI inference. The integration promises performance improvements and cost savings.
CoreWeave Marks Milestone with NVIDIA GB300 NVL72 Platform Deployment
CoreWeave becomes the first AI cloud provider to deploy NVIDIA's GB300 NVL72 systems, enhancing AI performance and expanding its cloud capabilities.
NVIDIA and AWS Join Forces to Enhance AI Training Scalability
NVIDIA Run:ai and AWS SageMaker HyperPod integrate to streamline AI training, offering enhanced scalability and resource management across hybrid cloud environments.
NVIDIA and EBU Team Up to Advance Sovereign AI for European Broadcasters
NVIDIA collaborates with the European Broadcasting Union to develop sovereign AI frameworks, enhancing the autonomy and innovation of public broadcasters across Europe.
NVIDIA Dynamo Introduces GPU Autoscaling and Kubernetes Automation
NVIDIA unveils Dynamo, an open-source inference framework, at GTC 2025, featuring GPU autoscaling, Kubernetes automation, and networking optimizations for AI deployment.
Anyscale Expands AI Compute Capabilities with New Multi-Cloud and AKS Support
Anyscale introduces enhanced AI compute solutions with support for Azure Kubernetes Service, Global Resource Scheduler, and upcoming multi-deployment management, optimizing resource utilization and scaling across cloud platforms.
NVIDIA and Microsoft Propel AI Innovation from Cloud to PC
NVIDIA and Microsoft collaborate to enhance AI applications, introducing breakthroughs from cloud to PC, with significant advancements in AI inferencing and development tools.
NVIDIA Air Services: Bridging Simulations with Real-World Applications
NVIDIA Air Services enhances simulation capabilities by integrating real-world applications, offering cloud-scale efficiency and seamless external connectivity for advanced data center infrastructure.
NVIDIA Introduces DGX Cloud Serverless Inference for Scalable AI Solutions
NVIDIA unveils DGX Cloud Serverless Inference, a new AI solution enabling seamless deployment across cloud environments with enhanced scalability and flexibility, targeting Independent Software Vendors (ISVs).
Enhancing Data Processing with NVIDIA KvikIO for Remote IO
NVIDIA's KvikIO offers high-performance remote IO capabilities, optimizing data processing for cloud workloads using object storage services like S3 and Azure Blob Storage.
CoreWeave Introduces NVIDIA Blackwell Cloud Instances for Enhanced AI Performance
CoreWeave has launched NVIDIA GB200 NVL72-based instances, marking the first general availability of NVIDIA Blackwell in the cloud, offering unprecedented AI performance and scalability.
NVIDIA DOCA 2.9 Revolutionizes AI and Cloud Infrastructure with Advanced Features
NVIDIA DOCA 2.9 introduces significant enhancements in AI and cloud computing, offering improved performance, security, and scalability for data centers and developers.
CoreWeave Secures $650 Million Investment to Enhance AI Cloud Platform
CoreWeave, an AI hyperscaler, garners $650 million in a minority investment led by top investors, showcasing confidence in its advanced cloud platform.
Japan's Cloud Giants Collaborate with NVIDIA to Revolutionize AI Infrastructure
Japan's leading cloud providers partner with NVIDIA to build AI infrastructure, enhancing industries like robotics, automotive, healthcare, and telecommunications with advanced computing and software solutions.
LangChain Unveils LangGraph Platform with Enhanced Deployment Options
LangChain introduces LangGraph Platform, offering various deployment options for scalable agent infrastructure, including self-hosted and cloud solutions, to meet diverse developer needs.