Large Language Models News

Large Language Models

Anthropic Drops Long-Context Premium as Claude 4.6 Models Hit 1M Tokens

Claude Opus 4.6 and Sonnet 4.6 now offer full 1M token context windows at standard API pricing, eliminating the long-context premium entirely.

by Felix Pinkston
Mar 14, 2026

Large Language Models

Anthropic Discovers 'Assistant Axis' to Prevent AI Jailbreaks and Persona Drift

Anthropic researchers map neural 'persona space' in LLMs, finding a key axis that controls AI character stability and blocks harmful behavior patterns.

by Caroline Bishop
Jan 20, 2026

Large Language Models

Optimizing Large Language Models with NVIDIA's TensorRT: Pruning and Distillation Explained

Explore how NVIDIA's TensorRT Model Optimizer utilizes pruning and distillation to enhance large language models, making them more efficient and cost-effective.

by Timothy Morano
Oct 07, 2025

Large Language Models

NVIDIA Enhances Local LLM Experience on RTX PCs with New Tools and Updates

NVIDIA introduces optimizations for running large language models locally on RTX PCs with tools like Ollama and LM Studio, enhancing AI applications' performance and privacy.

by Zach Anderson
Oct 01, 2025

Large Language Models

NVIDIA Launches Secure AI General Availability with Enhanced Protection for Large Language Models

NVIDIA announces the general availability of its Secure AI solution, focusing on protecting large language models with enhanced security features.

by Joerg Hiller
Apr 30, 2025

Large Language Models

NVIDIA's AI Sales Assistant: Insights and Innovations

Explore the development and key learnings from NVIDIA's AI sales assistant, leveraging large language models and retrieval-augmented generation to streamline sales workflows.

by Terrill Dicki
Jan 22, 2025

Large Language Models

NVIDIA Enhances TensorRT-LLM with KV Cache Optimization Features

NVIDIA introduces new KV cache optimizations in TensorRT-LLM, enhancing performance and efficiency for large language models on GPUs by managing memory and computational resources.

by Zach Anderson
Jan 17, 2025

Large Language Models

Enhancing Large Language Models with NVIDIA Triton and TensorRT-LLM on Kubernetes

Explore NVIDIA's methodology for optimizing large language models using Triton and TensorRT-LLM, while deploying and scaling these models efficiently in a Kubernetes environment.

by Iris Coleman
Oct 23, 2024

Large Language Models

NVIDIA NVLink and NVSwitch Enhance Large Language Model Inference

NVIDIA's NVLink and NVSwitch technologies boost large language model inference, enabling faster and more efficient multi-GPU processing.

by Felix Pinkston
Aug 13, 2024

Large Language Models

Deploying Trillion Parameter AI Models: NVIDIA's Solutions and Strategies

Explore NVIDIA's strategies for deploying trillion-parameter AI models, including parallelism techniques and the Blackwell architecture.

by Lawrence Jengar
Jun 13, 2024

Large Language Models

Enhancing AI's Operational Efficiency: Breakthroughs from Microsoft Research and Peking University

Researchers from Microsoft Research and Peking University have developed groundbreaking methods to enhance LLMs' ability to follow complex instructions and generate high-quality graphic designs, showcasing significant advancements in AI operational efficiency.

by Zach Anderson
Feb 13, 2024

Large Language Models

How Jailbreak Attacks Compromise ChatGPT and AI Models' Security

Recent studies reveal the vulnerabilities of large language models like GPT-4 to jailbreak attacks. Innovative defense strategies, such as self-reminders, are being developed to mitigate these risks, underscoring the need for enhanced AI security and ethical considerations.

by Massar Tanya Ming Yau Chong
Jan 25, 2024

Large Language Models

TOFU: How AI Can Forget Your Privacy Data

TOFU, a AI model, tackles the challenge of machine unlearning, aiming to make AI systems forget specific, unwanted data while retaining overall knowledge.

by Massar Tanya Ming Yau Chong
Jan 19, 2024

Large Language Models

Navigating the Resource Efficiency of Large Language Models: A Comprehensive Survey

A survey explores the resource efficiency in Large Language Models (LLMs) like OpenAI's ChatGPT, addressing high computational demands and proposing optimization strategies.

by Massar Tanya Ming Yau Chong
Jan 14, 2024

Large Language Models

How LLM Is Reshaping Agent-Based Modeling and Simulation

LLMs are reshaping agent-based modeling, enhancing simulations in social, economic, and cyber domains with advanced AI integration.

by Massar Tanya Ming Yau Chong
Jan 11, 2024

LARGE LANGUAGE MODELS