Machine Learning News

Machine Learning

AI Inference Costs Drop 40% With New GPU Optimization Tactics

Together AI reveals production-tested techniques cutting inference latency by 50-100ms while reducing per-token costs up to 5x through quantization and smart decoding.

by Jessie A Ellis
Jan 23, 2026

Machine Learning

GPU Waste Crisis Hits AI Production as Utilization Drops Below 50%

New analysis reveals production AI workloads achieve under 50% GPU utilization, with CPU-centric architectures blamed for billions in wasted compute resources.

by Joerg Hiller
Jan 22, 2026

Machine Learning

Anthropic Releases Full AI Constitution for Claude Under Open License

Anthropic publishes Claude's complete training constitution under CC0 license, detailing AI safety priorities and ethical guidelines as company eyes $350B valuation.

by Joerg Hiller
Jan 22, 2026

Machine Learning

LangChain Tackles AI Agent Observability Gap With New Insights Tool

LangChain launches Insights Agent to analyze 100k+ daily traces from AI agents, addressing the critical gap between data collection and actionable understanding.

by James Ding
Jan 21, 2026

Machine Learning

GitHub Copilot Gains Cross-Agent Memory System in Public Preview

GitHub launches memory feature for Copilot agents, enabling AI assistants to learn from past interactions and share knowledge across coding, CLI, and code review workflows.

by Alvin Lang
Jan 16, 2026

Machine Learning

LangChain Unveils Four Multi-Agent Architecture Patterns for AI Development

LangChain releases comprehensive guide to multi-agent AI systems, detailing subagents, skills, handoffs, and router patterns with performance benchmarks.

by Felix Pinkston
Jan 16, 2026

Machine Learning

Multi-Node GPU Training Guide Reveals 72B Model Scaling Secrets

Together.ai details how to train 72B parameter models across 128 GPUs, achieving 45-50% utilization with proper network tuning and fault tolerance.

by Jessie A Ellis
Jan 13, 2026

Machine Learning

Selecting the Optimal Open-Source Model for Production Applications

Explore the criteria for choosing the right open-source model for production, balancing quality, cost, and speed, while considering legal and technical factors.

by James Ding
Jan 09, 2026

Machine Learning

Character.ai Unveils Efficient Techniques for Large-Scale Pretraining

Character.ai reveals innovative methods for optimizing large-scale pretraining, focusing on techniques like Squinch, dynamic clamping, and Gumbel Softmax, to enhance efficiency in AI model training.

by Tony Kim
Dec 24, 2025

Machine Learning

Revolutionizing Semiconductor Defect Detection with AI-Powered Models

NVIDIA leverages generative AI and vision foundation models to enhance semiconductor defect classification, addressing limitations of traditional CNNs and improving manufacturing efficiency.

by Luisa Crawford
Dec 17, 2025

Machine Learning

NVIDIA Unveils Nemotron 3: Innovations in AI Model Efficiency and Accuracy

NVIDIA introduces Nemotron 3, an advanced AI model offering enhanced reasoning and efficiency through its hybrid Mamba-Transformer architecture and reinforcement learning capabilities.

by Zach Anderson
Dec 15, 2025

Machine Learning

Agent Engineering: Bridging the Gap Between Development and Production

Agent engineering is emerging as a crucial discipline in developing reliable AI systems. Learn how it combines product thinking, engineering, and data science for non-deterministic systems.

by Lawrence Jengar
Dec 10, 2025

Machine Learning

AutoJudge Revolutionizes LLM Inference with Enhanced Token Processing

AutoJudge introduces a novel method to accelerate large language model inference by optimizing token processing, reducing human annotation needs, and improving processing speed with minimal accuracy loss.

by Caroline Bishop
Dec 05, 2025

Machine Learning

NVIDIA's ToolOrchestra: Revolutionizing AI with Small Orchestration Agents

NVIDIA's ToolOrchestra employs small orchestration agents to optimize AI tasks, achieving superior performance and cost-efficiency. Discover how this innovation is reshaping AI paradigms.

by Iris Coleman
Dec 02, 2025

Machine Learning

NVIDIA Introduces Interactive AI Agent for Enhanced Machine Learning Efficiency

NVIDIA unveils an AI agent that accelerates machine learning tasks using GPU technology, simplifying workflows and boosting efficiency through modular design and language model integration.

by Rongchai Wang
Nov 07, 2025

MACHINE LEARNING