Model Optimization News | Blockchain.News

MODEL OPTIMIZATION

Model Optimization

Optimizing Large Language Models with NVIDIA's TensorRT: Pruning and Distillation Explained

Explore how NVIDIA's TensorRT Model Optimizer utilizes pruning and distillation to enhance large language models, making them more efficient and cost-effective.

by Timothy Morano
Oct 07, 2025

Model Optimization

Enhancing AI Model Efficiency: Torch-TensorRT Speeds Up PyTorch Inference

Discover how Torch-TensorRT optimizes PyTorch models for NVIDIA GPUs, doubling inference speed for diffusion models with minimal code changes.

by Timothy Morano
Jul 25, 2025