TOGETHER AI
NVIDIA Nemotron 3 Nano Omni Launches on Together AI for Multimodal AI
Together AI integrates NVIDIA Nemotron 3 Nano Omni, a multimodal AI model, offering developers scalable, efficient reasoning across video, audio, and text.
Together AI Launches Wan 2.7 Video Suite at $0.10 Per Second
Alibaba's Wan 2.7 AI video model hits Together AI with text-to-video now live, image-to-video and editing tools coming soon at competitive pricing.
Together AI Kernels Team Achieves 3.6x Performance Gains on NVIDIA Hardware
Together AI's kernel research team delivers major GPU optimization breakthroughs, cutting inference latency from 281ms to 77ms for enterprise AI deployments.
Together AI Upgrades Fine-Tuning Platform With Vision and Reasoning Support
Together AI adds tool calling, reasoning traces, and vision-language fine-tuning to its platform, with 6x throughput gains for 100B+ parameter models.
Together AI Launches Voice Agent Platform With Sub-700ms Latency
Together AI debuts unified voice agent infrastructure with Deepgram and Cartesia integrations, targeting enterprise deployments with end-to-end latency under 700ms.
NVIDIA Nemotron 3 Super Hits Together AI With 1M Token Context Window
NVIDIA's 120B-parameter Nemotron 3 Super model now available on Together AI, offering 5x throughput gains for multi-agent AI systems and enterprise workloads.
Together AI Upgrades GPU Clusters With Autoscaling and Self-Healing Features
Together AI adds enterprise-grade autoscaling, RBAC, observability dashboards, and self-healing node repair to GPU Clusters as company pursues $1B funding round.
Together AI's CDLM Achieves 14.5x Faster AI Inference Without Quality Loss
Consistency Diffusion Language Models solve two critical bottlenecks in AI inference, delivering up to 14.5x latency improvements while maintaining accuracy on coding and math tasks.
Together AI Achieves 40% Faster LLM Inference With Cache-Aware Architecture
Together AI's new CPD system separates warm and cold inference workloads, delivering 35-40% higher throughput for long-context AI applications on NVIDIA B200 GPUs.
Together AI Drops Largest Open Dataset for Training Coding Agents
TogetherCoder-Preview releases 161K verified coding trajectories achieving 59.4% on SWE-Bench, giving developers unprecedented training data for AI agents.
Together AI Opens Evaluations to OpenAI, Anthropic, Google Models
Together Evaluations now benchmarks proprietary AI models from OpenAI, Anthropic, and Google against open-source alternatives, claiming 10x cost savings.
Together AI Launches DSGym Framework for Training Data Science AI Agents
Together AI's DSGym framework benchmarks LLM agents on 90+ bioinformatics tasks and 92 Kaggle competitions. Their 4B parameter model matches larger rivals.
Together AI Integrates Rime Voice Models for Enhanced TTS Solutions
Together AI announces integration of Rime Arcana v2 and Mist v2 models to improve text-to-speech capabilities, offering enhanced expressivity and pronunciation control for enterprise applications.
NVIDIA's Nemotron 3 Nano Now Available on Together AI
NVIDIA's Nemotron 3 Nano, a cutting-edge reasoning model, is now accessible via Together AI, offering enhanced performance for multi-agent systems.
TorchForge RL Pipelines Now Operable on Together AI's Cloud
Together AI introduces TorchForge RL pipelines on its cloud platform, enhancing distributed training and sandboxed environments with a BlackJack training demo.