Llm News

Llm

LangChain Celebrates Two Years: Reflecting on Milestones and Future Directions

LangChain marks its second anniversary, highlighting its evolution from a Python package to a leading company in LLM applications, and introduces LangSmith and LangGraph.

by Rebeca Moen
Oct 25, 2024

Llm

Boosting LLM Performance on RTX: Leveraging LM Studio and GPU Offloading

Explore how GPU offloading with LM Studio enables efficient local execution of large language models on RTX-powered systems, enhancing AI applications' performance.

by Tony Kim
Oct 23, 2024

Llm

NVIDIA Unveils Llama 3.1-Nemotron-70B-Reward to Enhance AI Alignment with Human Preferences

NVIDIA introduces Llama 3.1-Nemotron-70B-Reward, a leading reward model that improves AI alignment with human preferences using RLHF, topping the RewardBench leaderboard.

by Felix Pinkston
Oct 06, 2024

Llm

NVIDIA and Outerbounds Revolutionize LLM-Powered Production Systems

NVIDIA and Outerbounds collaborate to streamline the development and deployment of LLM-powered production systems with advanced microservices and MLOps platforms.

by Lawrence Jengar
Oct 03, 2024

Llm

Ollama Enables Local Running of Llama 3.2 on AMD GPUs

Ollama makes it easier to run Meta's Llama 3.2 model locally on AMD GPUs, offering support for both Linux and Windows systems.

by Iris Coleman
Sep 27, 2024

Llm

LangGraph.js v0.2 Enhances JavaScript Agents with Cloud and Studio Support

LangChain releases LangGraph.js v0.2 with new features for building and deploying JavaScript agents, including support for LangGraph Cloud and LangGraph Studio.

by Ted Hisokawa
Sep 04, 2024

Llm

TEAL Introduces Training-Free Activation Sparsity to Boost LLM Efficiency

TEAL offers a training-free approach to activation sparsity, significantly enhancing the efficiency of large language models (LLMs) with minimal degradation.

by Zach Anderson
Sep 01, 2024

Llm

AMD Radeon PRO GPUs and ROCm Software Expand LLM Inference Capabilities

AMD's Radeon PRO GPUs and ROCm software enable small enterprises to leverage advanced AI tools, including Meta's Llama models, for various business applications.

by Felix Pinkston
Aug 31, 2024

Llm

NVIDIA's Blackwell Platform Breaks New Records in MLPerf Inference v4.1

NVIDIA's Blackwell architecture sets new benchmarks in MLPerf Inference v4.1, showcasing significant performance improvements in LLM inference.

by Joerg Hiller
Aug 29, 2024

Llm

MIT Research Unveils AI's Potential in Safeguarding Critical Infrastructure

MIT's new study reveals how large language models (LLMs) can efficiently detect anomalies in critical infrastructure systems, offering a plug-and-play solution.

by Joerg Hiller
Aug 27, 2024

Llm

Understanding Decoding Strategies in Large Language Models (LLMs)

Explore how Large Language Models (LLMs) choose the next word using decoding strategies. Learn about different methods like greedy search, beam search, and more.

by Darius Baruo
Aug 22, 2024

Llm

Strategies to Optimize Large Language Model (LLM) Inference Performance

NVIDIA experts share strategies to optimize large language model (LLM) inference performance, focusing on hardware sizing, resource optimization, and deployment methods.

by Iris Coleman
Aug 22, 2024

Llm

NVIDIA Unveils Pruning and Distillation Techniques for Efficient LLMs

NVIDIA introduces structured pruning and distillation methods to create efficient language models, significantly reducing resource demands while maintaining performance.

by Rebeca Moen
Aug 16, 2024

Llm

LangSmith Enhances LLM Apps with Dynamic Few-Shot Examples

LangSmith introduces dynamic few-shot example selectors, allowing for improved LLM app performance by dynamically selecting relevant examples based on user input.

by Rongchai Wang
Aug 07, 2024

Llm

Character.AI Enters Agreement with Google, Announces Leadership Changes

Character.AI announces a strategic agreement with Google and key leadership changes to accelerate the development of personalized AI products.

by Lawrence Jengar
Aug 03, 2024