Llm News

Llm

NVIDIA Introduces Efficient Fine-Tuning with NeMo Curator for Custom LLM Datasets

NVIDIA's NeMo Curator offers a streamlined method for fine-tuning large language models (LLMs) with custom datasets, enhancing machine learning workflows.

by Felix Pinkston
Aug 01, 2024

Llm

LangSmith Introduces Flexible Dataset Schemas for Efficient Data Curation

LangSmith now offers flexible dataset schemas, enabling efficient and iterative data curation for LLM applications, as announced by LangChain Blog.

by Zach Anderson
Jul 31, 2024

Llm

Codestral Mamba: NVIDIA's Next-Gen Coding LLM Revolutionizes Code Completion

NVIDIA's Codestral Mamba, built on Mamba-2 architecture, revolutionizes code completion with advanced AI, enabling superior coding efficiency.

by Jessie A Ellis
Jul 25, 2024

Llm

Enhancing LLM Tool-Calling Performance with Few-Shot Prompting

LangChain's experiments reveal how few-shot prompting significantly boosts LLM tool-calling accuracy, especially for complex tasks.

by Alvin Lang
Jul 25, 2024

Llm

NVIDIA and Meta Collaborate on Advanced RAG Pipelines with Llama 3.1 and NeMo Retriever NIMs

NVIDIA and Meta introduce scalable agentic RAG pipelines with Llama 3.1 and NeMo Retriever NIMs, optimizing LLM performance and decision-making capabilities.

by Peter Zhang
Jul 24, 2024

Llm

Enhancing Agent Planning: Insights from LangChain

LangChain explores the limitations and future of planning for agents with LLMs, highlighting cognitive architectures and current fixes.

by Alvin Lang
Jul 21, 2024

Llm

NVIDIA NeMo Enhances LLM Capabilities with Hybrid State Space Model Integration

NVIDIA NeMo introduces support for hybrid state space models, significantly enhancing the efficiency and capabilities of large language models.

by Tony Kim
Jul 18, 2024

Llm

NVIDIA NeMo Curator Enhances Non-English Dataset Preparation for LLM Training

NVIDIA NeMo Curator simplifies the curation of high-quality non-English datasets for LLM training, ensuring better model accuracy and reliability.

by Timothy Morano
Jul 12, 2024

Llm

WordSmith Enhances Legal AI Operations with LangSmith Integration

WordSmith leverages LangSmith for prototyping, debugging, and evaluating LLM performance, enhancing operations for in-house legal teams.

by Joerg Hiller
Jul 09, 2024

Llm

LangChain: Understanding Cognitive Architecture in AI Systems

Explore the concept of cognitive architecture in AI, outlining various levels of autonomy and their applications in LLM-driven systems.

by Zach Anderson
Jul 06, 2024

Llm

Understanding the Role and Capabilities of AI Agents

Explore the concept of AI agents, their varying degrees of autonomy, and the importance of agentic behavior in LLM applications, according to LangChain Blog.

by Rebeca Moen
Jun 30, 2024

Llm

Ensuring Integrity: Secure LLM Tokenizers Against Potential Threats

NVIDIA's AI Red Team highlights the risks and mitigation strategies for securing LLM tokenizers to maintain application integrity and prevent exploitation.

by Caroline Bishop
Jun 28, 2024

Llm

LangChain Introduces Self-Improving Evaluators for LLM-as-a-Judge

LangChain's new self-improving evaluators for LLM-as-a-Judge aim to align AI outputs with human preferences, leveraging few-shot learning and user feedback.

by Luisa Crawford
Jun 27, 2024

Llm

IBM Research Unveils Cost-Effective AI Inferencing with Speculative Decoding

IBM Research has developed a speculative decoding technique combined with paged attention to significantly enhance the cost performance of large language model (LLM) inferencing.

by Luisa Crawford
Jun 25, 2024

Llm

Character.AI Enhances AI Inference Efficiency, Reduces Costs by 33X

Character.AI announces significant breakthroughs in AI inference technology, reducing serving costs by 33 times since launch, making LLMs more scalable and cost-effective.

by Rebeca Moen
Jun 21, 2024