Ai Infrastructure News | Blockchain.News

AI INFRASTRUCTURE

NVIDIA Unveils BlueField-4 STX Storage Architecture for Agentic AI Workloads
Ai Infrastructure

NVIDIA Unveils BlueField-4 STX Storage Architecture for Agentic AI Workloads

NVIDIA launches BlueField-4 STX at GTC, promising 5x token throughput and 4x energy efficiency for AI infrastructure. Major cloud providers already on board.

NVIDIA Vera CPU Targets Agentic AI With 88-Core Design
Ai Infrastructure

NVIDIA Vera CPU Targets Agentic AI With 88-Core Design

NVIDIA launches Vera CPU with 88 custom cores and 1.2 TB/s memory bandwidth, claiming 50% faster performance than traditional CPUs for AI workloads.

NVIDIA Unveils Vera Rubin POD 40-Rack AI Supercomputer for Agentic Workloads
Ai Infrastructure

NVIDIA Unveils Vera Rubin POD 40-Rack AI Supercomputer for Agentic Workloads

NVIDIA announces Vera Rubin POD featuring 1,152 GPUs across 40 racks, delivering 60 exaflops and 10x better inference performance per watt than Blackwell.

Together AI Launches Voice Agent Platform With Sub-700ms Latency
Ai Infrastructure

Together AI Launches Voice Agent Platform With Sub-700ms Latency

Together AI debuts unified voice agent infrastructure with Deepgram and Cartesia integrations, targeting enterprise deployments with end-to-end latency under 700ms.

NVIDIA Launches AI Cluster Runtime to Standardize GPU Kubernetes Deployments
Ai Infrastructure

NVIDIA Launches AI Cluster Runtime to Standardize GPU Kubernetes Deployments

NVIDIA's new open-source AI Cluster Runtime project delivers validated, reproducible Kubernetes configurations for GPU clusters, targeting H100 and Blackwell accelerators.

NVIDIA Nemotron 3 Super Hits Together AI With 1M Token Context Window
Ai Infrastructure

NVIDIA Nemotron 3 Super Hits Together AI With 1M Token Context Window

NVIDIA's 120B-parameter Nemotron 3 Super model now available on Together AI, offering 5x throughput gains for multi-agent AI systems and enterprise workloads.

NVIDIA GTC 2026 Kicks Off March 16 With AI Infrastructure Focus
Ai Infrastructure

NVIDIA GTC 2026 Kicks Off March 16 With AI Infrastructure Focus

Jensen Huang's keynote will unveil Vera Rubin architecture details as NVDA trades at $185.52. Here's what traders should watch from San Jose.

NVIDIA Bets $2B on Nebius AI Cloud Partnership, NBIS Stock Jumps
Ai Infrastructure

NVIDIA Bets $2B on Nebius AI Cloud Partnership, NBIS Stock Jumps

NVIDIA invests $2 billion in Nebius to build hyperscale AI cloud infrastructure, targeting 5 gigawatts of compute capacity by 2030. NBIS shares surge on the news.

Oracle Stock Jumps 2.8% as Q3 Revenue Hits $17.2B on AI Cloud Surge
Ai Infrastructure

Oracle Stock Jumps 2.8% as Q3 Revenue Hits $17.2B on AI Cloud Surge

Oracle reports 22% revenue growth and 44% cloud surge in Q3 FY26, raises FY27 guidance to $90B as RPO explodes 325% to $553B on AI contracts.

Oracle AI Data Centers to Create 8,000 Jobs Across Four US States
Ai Infrastructure

Oracle AI Data Centers to Create 8,000 Jobs Across Four US States

Oracle details workforce expansion plans for AI data centers in Michigan, New Mexico, Texas, and Wisconsin, with thousands of construction and permanent positions.

NVIDIA Megatron Core Gets Falcon-H1 Hybrid AI Architecture Support
Ai Infrastructure

NVIDIA Megatron Core Gets Falcon-H1 Hybrid AI Architecture Support

Technology Innovation Institute integrates Falcon-H1 hybrid architecture and BitNet ternary training into NVIDIA's Megatron Core, enabling efficient large language model development.

NVIDIA Launches Open-Source NIXL Library to Speed AI Inference Data Transfers
Ai Infrastructure

NVIDIA Launches Open-Source NIXL Library to Speed AI Inference Data Transfers

NVIDIA releases Inference Transfer Library (NIXL), an open-source tool accelerating KV cache transfers for distributed AI inference across major cloud platforms.

NVIDIA AIConfigurator Slashes LLM Deployment Time With 38% Performance Gains
Ai Infrastructure

NVIDIA AIConfigurator Slashes LLM Deployment Time With 38% Performance Gains

NVIDIA's open-source AIConfigurator tool optimizes LLM serving configurations in seconds, delivering 38% throughput improvements for disaggregated AI inference deployments.

Bitfarms (BITF) Hires Six Executives to Lead HPC/AI Pivot as Stock Drops 8%
Ai Infrastructure

Bitfarms (BITF) Hires Six Executives to Lead HPC/AI Pivot as Stock Drops 8%

Bitfarms adds six senior leaders from MARA, Brookfield, and Stronghold as it accelerates transition from Bitcoin mining to AI data center infrastructure.

FlashAttention-4 Hits 71% GPU Utilization on NVIDIA Blackwell B200
Ai Infrastructure

FlashAttention-4 Hits 71% GPU Utilization on NVIDIA Blackwell B200

Together AI's FlashAttention-4 achieves 1,605 TFLOPs/s on B200 GPUs, up to 2.7x faster than Triton. New pipelining overcomes asymmetric hardware scaling bottlenecks.