LLMS
NVIDIA's ComputeEval 2025.2 Challenges LLMs with Advanced CUDA Tasks
NVIDIA expands ComputeEval with 232 new CUDA challenges, testing LLMs' capabilities in complex programming tasks. Discover the impact on AI-assisted coding.
Solana (SOL) Bench: Evaluating LLMs' Competence in Crypto Transactions
Solana (SOL) introduces Solana Bench, a tool to assess the effectiveness of LLMs in executing complex crypto transactions on the Solana blockchain.
Exploring Context Engineering in AI Agent Development
Discover how context engineering is transforming AI agent development by optimizing information management through strategies like writing, selecting, compressing, and isolating context.
Exploring Open Source Reinforcement Learning Libraries for LLMs
An in-depth analysis of leading open-source reinforcement learning libraries for large language models, comparing frameworks like TRL, Verl, and RAGEN.
NVIDIA Enhances AnythingLLM with RTX AI PC Acceleration
NVIDIA's latest integration of RTX GPUs with AnythingLLM offers faster performance for local AI workflows, enhancing accessibility for AI enthusiasts.
Open-Source AI: Mixture-of-Agents Alignment Revolutionizes Post-Training for LLMs
Mixture-of-Agents Alignment (MoAA) is a groundbreaking post-training method that enhances large language models by leveraging open-source collective intelligence, as detailed in a new ICML 2025 paper.
NVIDIA NIM Microservices Revolutionize Scientific Literature Reviews
NVIDIA's NIM microservices for LLMs are transforming the process of scientific literature reviews, offering enhanced speed and accuracy in information extraction and classification.
Efficient Meeting Summaries with LLMs Using Python
Learn how to create detailed meeting summaries using AssemblyAI's LeMUR framework and large language models (LLMs) with just five lines of Python code.
Exploring the Impact of LLM Integration on Conversation Intelligence Platforms
Discover how integrating Large Language Models (LLMs) revolutionizes Conversation Intelligence platforms, enhancing user experience, customer understanding, and decision-making processes.
Enhancing LLMs for Domain-Specific Multi-Turn Conversations
Explore the challenges and solutions in fine-tuning Large Language Models (LLMs) for effective domain-specific multi-turn conversations, as detailed by together.ai.
Exploring Model Merging Techniques for Large Language Models (LLMs)
Discover how model merging enhances the efficiency of large language models by repurposing resources and improving task-specific performance, according to NVIDIA's insights.
Innovative LoLCATs Method Enhances LLM Efficiency and Quality
Together.ai introduces LoLCATs, a novel approach for linearizing LLMs, enhancing efficiency and quality. This method promises significant improvements in AI model development.
Llama 3.1 405B Achieves 1.5x Throughput Boost with NVIDIA H200 GPUs and NVLink
NVIDIA's latest advancements in parallelism techniques enhance Llama 3.1 405B throughput by 1.5x, using NVIDIA H200 Tensor Core GPUs and NVLink Switch, improving AI inference performance.
NVIDIA GH200 NVL32: Revolutionizing Time-to-First-Token Performance with NVLink Switch
NVIDIA's GH200 NVL32 system shows significant improvements in time-to-first-token performance for large language models, enhancing real-time AI applications.
AI21 Labs Unveils Jamba 1.5 LLMs with Hybrid Architecture for Enhanced Reasoning
AI21 Labs introduces Jamba 1.5, a new family of large language models leveraging hybrid architecture for superior reasoning and long context handling.