Topic
Model Optimization
Learn Model Optimization as a connected topic across chapters, concepts, simulations, and interview reasoning.
10 Concepts2 Articles30 min
Overview
Learn Model Optimization as a connected topic across chapters, concepts, simulations, and interview reasoning.
How this topic helps
Ai
Deep Learning
Inference
Llm
Learning Path in this Topic
Series that contain articles from Model Optimization. Select a path to filter the article list.
Articles
2 matched articles
Article 1Types of LLM Quantization: By Timing, Scope, and MappingTLDR: There is no single "best" LLM quantization. You classify and choose quantization along three axes: when you quantize (timing), what you quantize (scope), and how values are encoded (mapping). In17 min
Article 2LLM Model Quantization: Why, When, and How to Deploy Smaller, Faster ModelsTLDR: Quantization converts high-precision model weights and activations (FP16/FP32) into lower-precision formats (INT8 or INT4) so LLMs run with less memory, lower latency, and lower cost. The key is13 min