MACHINE LEARNING
Together AI Showcases Nine Papers at ICML 2026 in Seoul
Together AI presents nine groundbreaking papers at ICML 2026, covering AI agents, model efficiency, and GPU optimization. Booth B714, July 6-11.
Ray Data 2.56 Enhances AI Pipelines with Zero OOM Errors
Ray Data 2.56 eliminates OOM errors, reduces memory pressure, and improves AI training/inference speeds by over 50%.
MiniMax M3 Debuts on NVIDIA: 1M Token Context, Multimodal AI
MiniMax M3, the 428B-parameter model, launches on NVIDIA infrastructure, offering long-context reasoning and multimodal workflows for enterprise AI.
NVIDIA's Nemotron 3 Ultra Redefines AI for Long-Running Agents
NVIDIA's Nemotron 3 Ultra, a 550B-parameter AI model, offers faster, cost-efficient reasoning for complex workflows, enabling long-running agents to perform better.
Mastering AI Agent Customization with NVIDIA Tools
NVIDIA unveils methods for customizing autonomous AI agents, from prompt engineering to advanced reinforcement learning.
NVIDIA, Google Cloud Expand AI Ecosystem for 100K+ Developers
NVIDIA and Google Cloud grow their joint AI developer community to over 100,000 members, offering new tools for cutting-edge AI applications.
NVIDIA FLARE Simplifies Federated Learning for ML Teams
NVIDIA FLARE removes barriers to federated learning adoption by simplifying workflows and enhancing compliance, privacy, and scalability.
NVIDIA NeMo RL Achieves 48% Speedup with End-to-End FP8 Precision Training
NVIDIA's new FP8 recipe for reinforcement learning delivers 48% faster training while matching BF16 accuracy, cutting AI infrastructure costs significantly.
Google Launches Gemini 3.1 Flash TTS with Audio Tags for AI Voice Control
Google releases Gemini 3.1 Flash TTS across developer platforms, featuring new audio tags for granular speech control and support for 70+ languages.
Anthropic's AI Researchers Outperform Humans 4x on Alignment Task
Anthropic's Claude models achieved 97% success rate on AI safety benchmark versus 23% human baseline, spending $18K over 800 hours of autonomous research.
NVIDIA Launches ALCHEMI Toolkit for GPU-Accelerated Chemistry Simulations
NVIDIA releases ALCHEMI Toolkit enabling researchers to build custom atomistic simulation workflows with up to 33x speedups for batched molecular dynamics on GPUs.
MiniMax M2.7 Brings 230B-Parameter AI Model to NVIDIA Infrastructure
MiniMax releases M2.7, a 230B-parameter mixture-of-experts model optimized for NVIDIA GPUs with up to 2.7x throughput gains on Blackwell hardware.
Anthropic Reveals Claude Code Tool Design Philosophy Behind AI Agent Development
Anthropic engineers detail how they build and refine AI agent tools for Claude Code, introducing progressive disclosure techniques that shape AI development.
NVIDIA nvCOMP Cuts AI Training Checkpoint Costs by $56K Monthly
New GPU compression library reduces LLM training checkpoint sizes by 25-40%, saving teams up to $222K monthly on large-scale model training infrastructure.
Notion Slashes AI Embedding Costs 80% After Ditching Spark for Ray
Notion migrated from Spark on EMR to Ray, cutting embedding costs 80% and improving query latency 10x. Uber and Salesforce shared similar AI infrastructure wins.