What is ai infrastructure? ai infrastructure news, ai infrastructure meaning, ai infrastructure definition - Blockchain.News

Search Results for "ai infrastructure"

NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains

NVIDIA Blackwell Ultra GPUs Crush MLPerf Benchmarks with 2.7x Performance Gains

NVIDIA's Blackwell Ultra GPUs set new MLPerf Inference records with 2.7x faster DeepSeek-R1 processing, hitting 2.5 million tokens per second across 288 GPUs.

Together AI Kernels Team Achieves 3.6x Performance Gains on NVIDIA Hardware

Together AI Kernels Team Achieves 3.6x Performance Gains on NVIDIA Hardware

Together AI's kernel research team delivers major GPU optimization breakthroughs, cutting inference latency from 281ms to 77ms for enterprise AI deployments.

Ray 2.55 Adds Fault Tolerance for Large-Scale AI Model Deployments

Ray 2.55 Adds Fault Tolerance for Large-Scale AI Model Deployments

Anyscale's Ray Serve LLM update enables DP group fault tolerance for vLLM WideEP deployments, reducing downtime risk for distributed AI inference systems.

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

Riot Platforms Sells $289M in Bitcoin as Mining Output Drops 4% in Q1

RIOT sold 3,778 BTC at $76,626 average while Q1 production fell to 1,473 coins. Hash rate jumped 26% but treasury shrinks 18% as miners pivot toward AI.

Harvey AI Unveils Spectre Cloud Agent Platform for Enterprise Development

Harvey AI Unveils Spectre Cloud Agent Platform for Enterprise Development

Legal AI startup Harvey reveals internal cloud agent platform Spectre, signaling infrastructure approach that could reshape enterprise AI deployment across industries.

NVIDIA Unveils Mission Control Software for Blackwell AI Supercomputers

NVIDIA Unveils Mission Control Software for Blackwell AI Supercomputers

NVIDIA's Mission Control bridges rack-scale GPU hardware with AI workload schedulers, enabling topology-aware job placement on GB200 and GB300 NVL72 systems.

Notion Slashes AI Embedding Costs 80% After Ditching Spark for Ray

Notion Slashes AI Embedding Costs 80% After Ditching Spark for Ray

Notion migrated from Spark on EMR to Ray, cutting embedding costs 80% and improving query latency 10x. Uber and Salesforce shared similar AI infrastructure wins.

NVIDIA Open-Sources Slinky to Run Slurm GPU Workloads on Kubernetes

NVIDIA Open-Sources Slinky to Run Slurm GPU Workloads on Kubernetes

NVIDIA's Slinky project enables running Slurm clusters on Kubernetes, already deployed on 8,000+ GPU systems for large-scale AI training infrastructure.

MiniMax M2.7 Brings 230B-Parameter AI Model to NVIDIA Infrastructure

MiniMax M2.7 Brings 230B-Parameter AI Model to NVIDIA Infrastructure

MiniMax releases M2.7, a 230B-parameter mixture-of-experts model optimized for NVIDIA GPUs with up to 2.7x throughput gains on Blackwell hardware.

NVIDIA NVbandwidth Tool Gets Multi-Node Support for AI Infrastructure Testing

NVIDIA NVbandwidth Tool Gets Multi-Node Support for AI Infrastructure Testing

NVIDIA's NVbandwidth benchmarking tool now supports multi-node GPU clusters, enabling developers to measure bandwidth across NVLink connections at 397+ GB/s.

Eigen Labs Launches Project Darkbloom to Turn Idle Macs Into AI Compute Network

Eigen Labs Launches Project Darkbloom to Turn Idle Macs Into AI Compute Network

New research initiative from Eigen Labs aims to route AI inference through underused Apple Silicon machines, claiming 50% cost reduction versus major providers.

NVIDIA Claims 35x Cost Reduction in AI Token Generation With Blackwell

NVIDIA Claims 35x Cost Reduction in AI Token Generation With Blackwell

NVIDIA's Blackwell architecture delivers $0.12 per million tokens versus $4.20 on Hopper, reshaping AI infrastructure economics for enterprise deployments.

Trending topics