Search results for
quantization
Nexa AI Enhances DeepSeek R1 Distill Performance with NexaQuant on AMD Platforms
Nexa AI introduces NexaQuant technology for DeepSeek R1 Distills, optimizing performance on AMD platforms with improved inference capabilities and reduced memory footprint.
FLUX.1 Kontext Revolutionizes Image Editing with Low-Precision Quantization
Black Forest Labs introduces FLUX.1 Kontext, optimized with NVIDIA's TensorRT for enhanced image editing performance using low-precision quantization on RTX GPUs.
Enhancing Large Language Models: NVIDIA's Post-Training Quantization Techniques
NVIDIA's post-training quantization (PTQ) advances performance and efficiency in AI models, leveraging formats like NVFP4 for optimized inference without retraining, according to NVIDIA.