Ai Infrastructure News

Ai Infrastructure

NVIDIA Nemotron 3 Super Hits Together AI With 1M Token Context Window

NVIDIA's 120B-parameter Nemotron 3 Super model now available on Together AI, offering 5x throughput gains for multi-agent AI systems and enterprise workloads.

by Jessie A Ellis
Mar 12, 2026

Ai Infrastructure

NVIDIA GTC 2026 Kicks Off March 16 With AI Infrastructure Focus

Jensen Huang's keynote will unveil Vera Rubin architecture details as NVDA trades at $185.52. Here's what traders should watch from San Jose.

by Iris Coleman
Mar 12, 2026

Ai Infrastructure

NVIDIA Bets $2B on Nebius AI Cloud Partnership, NBIS Stock Jumps

NVIDIA invests $2 billion in Nebius to build hyperscale AI cloud infrastructure, targeting 5 gigawatts of compute capacity by 2030. NBIS shares surge on the news.

by James Ding
Mar 11, 2026

Ai Infrastructure

Oracle Stock Jumps 2.8% as Q3 Revenue Hits $17.2B on AI Cloud Surge

Oracle reports 22% revenue growth and 44% cloud surge in Q3 FY26, raises FY27 guidance to $90B as RPO explodes 325% to $553B on AI contracts.

by Joerg Hiller
Mar 11, 2026

Ai Infrastructure

Oracle AI Data Centers to Create 8,000 Jobs Across Four US States

Oracle details workforce expansion plans for AI data centers in Michigan, New Mexico, Texas, and Wisconsin, with thousands of construction and permanent positions.

by Iris Coleman
Mar 10, 2026

Ai Infrastructure

NVIDIA Megatron Core Gets Falcon-H1 Hybrid AI Architecture Support

Technology Innovation Institute integrates Falcon-H1 hybrid architecture and BitNet ternary training into NVIDIA's Megatron Core, enabling efficient large language model development.

by Lawrence Jengar
Mar 10, 2026

Ai Infrastructure

NVIDIA Launches Open-Source NIXL Library to Speed AI Inference Data Transfers

NVIDIA releases Inference Transfer Library (NIXL), an open-source tool accelerating KV cache transfers for distributed AI inference across major cloud platforms.

by Lawrence Jengar
Mar 10, 2026

Ai Infrastructure

NVIDIA AIConfigurator Slashes LLM Deployment Time With 38% Performance Gains

NVIDIA's open-source AIConfigurator tool optimizes LLM serving configurations in seconds, delivering 38% throughput improvements for disaggregated AI inference deployments.

by Terrill Dicki
Mar 10, 2026

Ai Infrastructure

Bitfarms (BITF) Hires Six Executives to Lead HPC/AI Pivot as Stock Drops 8%

Bitfarms adds six senior leaders from MARA, Brookfield, and Stronghold as it accelerates transition from Bitcoin mining to AI data center infrastructure.

by Peter Zhang
Mar 09, 2026

Ai Infrastructure

FlashAttention-4 Hits 71% GPU Utilization on NVIDIA Blackwell B200

Together AI's FlashAttention-4 achieves 1,605 TFLOPs/s on B200 GPUs, up to 2.7x faster than Triton. New pipelining overcomes asymmetric hardware scaling bottlenecks.

by Terrill Dicki
Mar 05, 2026

Ai Infrastructure

OpenAI Partners With Tata Group to Build 1GW AI Infrastructure in India

OpenAI launches India initiative with Tata Group, starting with 100MW data centers scaling to 1GW. ChatGPT Enterprise deployment planned for hundreds of thousands of TCS employees.

by Rongchai Wang
Mar 05, 2026

Ai Infrastructure

NVIDIA Releases Flash Attention Optimization Guide for Blackwell GPUs

NVIDIA's new cuTile framework delivers 1.6x speedups for Flash Attention on B200 GPUs, enabling faster LLM inference critical for AI infrastructure.

by Lawrence Jengar
Mar 05, 2026

Ai Infrastructure

Oracle Seeks Air Permits for $165B Project Jupiter AI Campus in New Mexico

Oracle submitted permit applications for natural gas microgrids at its massive AI data center tied to the $500B Stargate initiative with OpenAI.

by Alvin Lang
Mar 04, 2026

Ai Infrastructure

NVIDIA GTC 2026 Set for March 16-19 as Jensen Huang Teases Full AI Stack Reveal

NVIDIA's GTC 2026 conference runs March 16-19 in San Jose with 30,000+ attendees expected. CEO Jensen Huang keynote may unveil rumored new inference chip.

by Peter Zhang
Mar 03, 2026

Ai Infrastructure

NVIDIA Drops $2B on Coherent as Optical Tech Race Heats Up

NVIDIA invests $2 billion in Coherent to develop silicon photonics for AI data centers, part of a broader $4B push into optical interconnect technology.

by Alvin Lang
Mar 03, 2026

AI INFRASTRUCTURE