Abstract AlgorithmsAbstract Algorithms

  • Home
  • All Posts
  • All Series
  • About

Category

transformers

1 article in this category

Practical LLM Quantization in Colab: A Hugging Face Walkthrough

TLDR: This is a practical, notebook-style quantization guide for Google Colab and Hugging Face. You will quantize real models, run inference, compare memory/latency, and learn when to use 4-bit NF4 vs safer INT8 paths. šŸ“– What You Will Build in Thi...

Mar 12, 2026•10 min read

Abstract Algorithms

Exploring the fascinating world of algorithms, data structures, and software engineering through clear explanations and practical examples.

Navigation

  • Home
  • All Posts
  • All Series
  • About

Popular Topics

  • System Design
  • architecture
  • event-driven-architecture
  • data-engineering
  • Microservices
  • distributed systems

Author

Abstract Algorithms

Abstract Algorithms

@abstractalgorithms

Ā© 2026 Abstract Algorithms. All rights reserved.

Powered by Hashnode