AI INFRASTRUCTURE
NVIDIA Nemotron 3 Super Hits Together AI With 1M Token Context Window
NVIDIA's 120B-parameter Nemotron 3 Super model now available on Together AI, offering 5x throughput gains for multi-agent AI systems and enterprise workloads.
NVIDIA GTC 2026 Kicks Off March 16 With AI Infrastructure Focus
Jensen Huang's keynote will unveil Vera Rubin architecture details as NVDA trades at $185.52. Here's what traders should watch from San Jose.
NVIDIA Bets $2B on Nebius AI Cloud Partnership, NBIS Stock Jumps
NVIDIA invests $2 billion in Nebius to build hyperscale AI cloud infrastructure, targeting 5 gigawatts of compute capacity by 2030. NBIS shares surge on the news.
Oracle Stock Jumps 2.8% as Q3 Revenue Hits $17.2B on AI Cloud Surge
Oracle reports 22% revenue growth and 44% cloud surge in Q3 FY26, raises FY27 guidance to $90B as RPO explodes 325% to $553B on AI contracts.
Oracle AI Data Centers to Create 8,000 Jobs Across Four US States
Oracle details workforce expansion plans for AI data centers in Michigan, New Mexico, Texas, and Wisconsin, with thousands of construction and permanent positions.
NVIDIA Megatron Core Gets Falcon-H1 Hybrid AI Architecture Support
Technology Innovation Institute integrates Falcon-H1 hybrid architecture and BitNet ternary training into NVIDIA's Megatron Core, enabling efficient large language model development.
NVIDIA Launches Open-Source NIXL Library to Speed AI Inference Data Transfers
NVIDIA releases Inference Transfer Library (NIXL), an open-source tool accelerating KV cache transfers for distributed AI inference across major cloud platforms.
NVIDIA AIConfigurator Slashes LLM Deployment Time With 38% Performance Gains
NVIDIA's open-source AIConfigurator tool optimizes LLM serving configurations in seconds, delivering 38% throughput improvements for disaggregated AI inference deployments.
Bitfarms (BITF) Hires Six Executives to Lead HPC/AI Pivot as Stock Drops 8%
Bitfarms adds six senior leaders from MARA, Brookfield, and Stronghold as it accelerates transition from Bitcoin mining to AI data center infrastructure.
FlashAttention-4 Hits 71% GPU Utilization on NVIDIA Blackwell B200
Together AI's FlashAttention-4 achieves 1,605 TFLOPs/s on B200 GPUs, up to 2.7x faster than Triton. New pipelining overcomes asymmetric hardware scaling bottlenecks.
OpenAI Partners With Tata Group to Build 1GW AI Infrastructure in India
OpenAI launches India initiative with Tata Group, starting with 100MW data centers scaling to 1GW. ChatGPT Enterprise deployment planned for hundreds of thousands of TCS employees.
NVIDIA Releases Flash Attention Optimization Guide for Blackwell GPUs
NVIDIA's new cuTile framework delivers 1.6x speedups for Flash Attention on B200 GPUs, enabling faster LLM inference critical for AI infrastructure.
Oracle Seeks Air Permits for $165B Project Jupiter AI Campus in New Mexico
Oracle submitted permit applications for natural gas microgrids at its massive AI data center tied to the $500B Stargate initiative with OpenAI.
NVIDIA GTC 2026 Set for March 16-19 as Jensen Huang Teases Full AI Stack Reveal
NVIDIA's GTC 2026 conference runs March 16-19 in San Jose with 30,000+ attendees expected. CEO Jensen Huang keynote may unveil rumored new inference chip.
NVIDIA Drops $2B on Coherent as Optical Tech Race Heats Up
NVIDIA invests $2 billion in Coherent to develop silicon photonics for AI data centers, part of a broader $4B push into optical interconnect technology.