Ai Infrastructure News

Ai Infrastructure

Harvey AI Unveils Spectre Cloud Agent Platform for Enterprise Development

Legal AI startup Harvey reveals internal cloud agent platform Spectre, signaling infrastructure approach that could reshape enterprise AI deployment across industries.

by Luisa Crawford
Apr 08, 2026

Ai Infrastructure

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

RIOT sold 3,778 BTC at $76,626 average while Q1 production fell to 1,473 coins. Hash rate jumped 26% but treasury shrinks 18% as miners pivot toward AI.

by Rongchai Wang
Apr 03, 2026

Ai Infrastructure

Ray 2.55 Adds Fault Tolerance for Large-Scale AI Model Deployments

Anyscale's Ray Serve LLM update enables DP group fault tolerance for vLLM WideEP deployments, reducing downtime risk for distributed AI inference systems.

by Joerg Hiller
Apr 03, 2026

Ai Infrastructure

Together AI Kernels Team Achieves 3.6x Performance Gains on NVIDIA Hardware

Together AI's kernel research team delivers major GPU optimization breakthroughs, cutting inference latency from 281ms to 77ms for enterprise AI deployments.

by Timothy Morano
Apr 02, 2026

Ai Infrastructure

NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains

NVIDIA's Blackwell Ultra GPUs set new MLPerf Inference records with 2.7x faster DeepSeek-R1 processing, hitting 2.5 million tokens per second across 288 GPUs.

by Iris Coleman
Apr 01, 2026

Ai Infrastructure

Bitfarms Becomes Keel Infrastructure, Completes Delaware Move Amid Bitcoin Exit

Former Bitcoin miner Bitfarms officially rebrands as Keel Infrastructure, completing U.S. redomiciliation as it pivots to 2.2GW AI data center business.

by Rongchai Wang
Apr 01, 2026

Ai Infrastructure

Oracle Brings NVIDIA B300 GPUs and xAI Grok to Government Cloud Regions

Oracle expands AI infrastructure for U.S. government customers with NVIDIA Blackwell Ultra GPUs and xAI Grok models in secure cloud regions.

by Peter Zhang
Mar 31, 2026

Ai Infrastructure

Filecoin (FIL) Onchain Cloud Hits Mainnet With 49 TiB Already Stored

Filecoin (FIL) launches programmable cloud storage for AI agents with onchain proofs, automatic payments, and two-copy replication at $2.50/TiB monthly.

by Lawrence Jengar
Mar 26, 2026

Ai Infrastructure

NVIDIA MIG Boosts AI Infrastructure ROI by 33% Over Time-Slicing

New NVIDIA benchmarks show Multi-Instance GPU partitioning achieves 1.00 req/s per GPU versus 0.76 for time-slicing in production AI workloads.

by Jessie A Ellis
Mar 26, 2026

Ai Infrastructure

NVIDIA Claims 1 Million X Efficiency Gains Across Six GPU Generations

NVIDIA details how Vera Rubin platform delivers 10x higher inference throughput per megawatt, reshaping AI data center economics and token factory revenue models.

by Rongchai Wang
Mar 25, 2026

Ai Infrastructure

Ray Serve Upgrade Delivers 88% Lower Latency for AI Inference at Scale

Anyscale announces major Ray Serve optimizations with HAProxy and gRPC, achieving 11.1x throughput gains for LLM inference workloads on enterprise deployments.

by Jessie A Ellis
Mar 25, 2026

Ai Infrastructure

NVIDIA Donates GPU Resource Driver to Kubernetes Open Source Project

NVIDIA transfers critical GPU allocation software to CNCF at KubeCon Europe, marking major shift toward community-governed AI infrastructure.

by Ted Hisokawa
Mar 24, 2026

Ai Infrastructure

NVIDIA Advances AI Infrastructure With Disaggregated LLM Inference on Kubernetes

NVIDIA details new Kubernetes deployment patterns for disaggregated LLM inference using Dynamo and Grove, promising better GPU utilization for AI workloads.

by Terrill Dicki
Mar 23, 2026

Ai Infrastructure

Together AI Upgrades Fine-Tuning Platform With Vision and Reasoning Support

Together AI adds tool calling, reasoning traces, and vision-language fine-tuning to its platform, with 6x throughput gains for 100B+ parameter models.

by Joerg Hiller
Mar 19, 2026

Ai Infrastructure

NVIDIA Unveils AI Grid Architecture for Distributed Edge Inference at GTC 2026

NVIDIA's AI Grid reference design enables telcos to cut inference costs by 76% and meet sub-500ms latency targets through distributed edge computing.

by Jessie A Ellis
Mar 18, 2026

AI INFRASTRUCTURE