MACHINE LEARNING
Multi-Node GPU Training Guide Reveals 72B Model Scaling Secrets
Together.ai details how to train 72B parameter models across 128 GPUs, achieving 45-50% utilization with proper network tuning and fault tolerance.
Selecting the Optimal Open-Source Model for Production Applications
Explore the criteria for choosing the right open-source model for production, balancing quality, cost, and speed, while considering legal and technical factors.
Character.ai Unveils Efficient Techniques for Large-Scale Pretraining
Character.ai reveals innovative methods for optimizing large-scale pretraining, focusing on techniques like Squinch, dynamic clamping, and Gumbel Softmax, to enhance efficiency in AI model training.
Revolutionizing Semiconductor Defect Detection with AI-Powered Models
NVIDIA leverages generative AI and vision foundation models to enhance semiconductor defect classification, addressing limitations of traditional CNNs and improving manufacturing efficiency.
NVIDIA Unveils Nemotron 3: Innovations in AI Model Efficiency and Accuracy
NVIDIA introduces Nemotron 3, an advanced AI model offering enhanced reasoning and efficiency through its hybrid Mamba-Transformer architecture and reinforcement learning capabilities.
Agent Engineering: Bridging the Gap Between Development and Production
Agent engineering is emerging as a crucial discipline in developing reliable AI systems. Learn how it combines product thinking, engineering, and data science for non-deterministic systems.
AutoJudge Revolutionizes LLM Inference with Enhanced Token Processing
AutoJudge introduces a novel method to accelerate large language model inference by optimizing token processing, reducing human annotation needs, and improving processing speed with minimal accuracy loss.
NVIDIA's ToolOrchestra: Revolutionizing AI with Small Orchestration Agents
NVIDIA's ToolOrchestra employs small orchestration agents to optimize AI tasks, achieving superior performance and cost-efficiency. Discover how this innovation is reshaping AI paradigms.
NVIDIA Introduces Interactive AI Agent for Enhanced Machine Learning Efficiency
NVIDIA unveils an AI agent that accelerates machine learning tasks using GPU technology, simplifying workflows and boosting efficiency through modular design and language model integration.
Character.AI's Kaiju: Scaling Conversational Models with Efficiency and Safety
Character.AI's Kaiju models offer a scalable and efficient solution for conversational AI, focusing on safety and engagement through innovative architectural features.
Understanding the Rise of Graph Neural Networks in AI
Graph Neural Networks (GNNs) are reshaping AI by enhancing data interpretation and improving applications. Learn how GNNs are crucial in advancing machine learning models.
Large Reasoning Models Struggle with Instruction Adherence, Study Reveals
A recent study by Together AI unveils that large reasoning models often fail to comply with instructions during reasoning, highlighting significant challenges in AI model adherence.
Harnessing AI for Crypto Trading: Opportunities and Challenges
Explore how AI trading tools are transforming crypto markets, enhancing decision-making, and automating execution. Understand the benefits and challenges of integrating AI into trading strategies.
ATLAS: Revolutionizing LLM Inference with Adaptive Learning
Together.ai introduces ATLAS, a system enhancing LLM inference speed by adapting to workloads, achieving 500 TPS on DeepSeek-V3.1.
Enhancing Text-to-SQL Models Using Tinker and Ray
Discover how Tinker and Ray are utilized to fine-tune text-to-SQL models, enhancing AI capabilities in generating efficient SQL queries.