AI INFRASTRUCTURE
NVIDIA Unveils BlueField-4 STX Storage Architecture for Agentic AI Workloads
NVIDIA launches BlueField-4 STX at GTC, promising 5x token throughput and 4x energy efficiency for AI infrastructure. Major cloud providers already on board.
NVIDIA Vera CPU Targets Agentic AI With 88-Core Design
NVIDIA launches Vera CPU with 88 custom cores and 1.2 TB/s memory bandwidth, claiming 50% faster performance than traditional CPUs for AI workloads.
NVIDIA Unveils Vera Rubin POD 40-Rack AI Supercomputer for Agentic Workloads
NVIDIA announces Vera Rubin POD featuring 1,152 GPUs across 40 racks, delivering 60 exaflops and 10x better inference performance per watt than Blackwell.
Together AI Launches Voice Agent Platform With Sub-700ms Latency
Together AI debuts unified voice agent infrastructure with Deepgram and Cartesia integrations, targeting enterprise deployments with end-to-end latency under 700ms.
NVIDIA Launches AI Cluster Runtime to Standardize GPU Kubernetes Deployments
NVIDIA's new open-source AI Cluster Runtime project delivers validated, reproducible Kubernetes configurations for GPU clusters, targeting H100 and Blackwell accelerators.
NVIDIA Nemotron 3 Super Hits Together AI With 1M Token Context Window
NVIDIA's 120B-parameter Nemotron 3 Super model now available on Together AI, offering 5x throughput gains for multi-agent AI systems and enterprise workloads.
NVIDIA GTC 2026 Kicks Off March 16 With AI Infrastructure Focus
Jensen Huang's keynote will unveil Vera Rubin architecture details as NVDA trades at $185.52. Here's what traders should watch from San Jose.
NVIDIA Bets $2B on Nebius AI Cloud Partnership, NBIS Stock Jumps
NVIDIA invests $2 billion in Nebius to build hyperscale AI cloud infrastructure, targeting 5 gigawatts of compute capacity by 2030. NBIS shares surge on the news.
Oracle Stock Jumps 2.8% as Q3 Revenue Hits $17.2B on AI Cloud Surge
Oracle reports 22% revenue growth and 44% cloud surge in Q3 FY26, raises FY27 guidance to $90B as RPO explodes 325% to $553B on AI contracts.
Oracle AI Data Centers to Create 8,000 Jobs Across Four US States
Oracle details workforce expansion plans for AI data centers in Michigan, New Mexico, Texas, and Wisconsin, with thousands of construction and permanent positions.
NVIDIA Megatron Core Gets Falcon-H1 Hybrid AI Architecture Support
Technology Innovation Institute integrates Falcon-H1 hybrid architecture and BitNet ternary training into NVIDIA's Megatron Core, enabling efficient large language model development.
NVIDIA Launches Open-Source NIXL Library to Speed AI Inference Data Transfers
NVIDIA releases Inference Transfer Library (NIXL), an open-source tool accelerating KV cache transfers for distributed AI inference across major cloud platforms.
NVIDIA AIConfigurator Slashes LLM Deployment Time With 38% Performance Gains
NVIDIA's open-source AIConfigurator tool optimizes LLM serving configurations in seconds, delivering 38% throughput improvements for disaggregated AI inference deployments.
Bitfarms (BITF) Hires Six Executives to Lead HPC/AI Pivot as Stock Drops 8%
Bitfarms adds six senior leaders from MARA, Brookfield, and Stronghold as it accelerates transition from Bitcoin mining to AI data center infrastructure.
FlashAttention-4 Hits 71% GPU Utilization on NVIDIA Blackwell B200
Together AI's FlashAttention-4 achieves 1,605 TFLOPs/s on B200 GPUs, up to 2.7x faster than Triton. New pipelining overcomes asymmetric hardware scaling bottlenecks.