Chapters
Engineering chapters for understanding systems.
Distributed systems, AI infrastructure, data structures, and system design explained with calm, production-minded depth.
Find a chapter
Continue reading
Pick up a chapter

LLM Skills vs Tools: The Missing Layer in Agent Design
TLDR: A tool is a single callable capability (search, SQL, calculator). A skill is a reusable mini-workflow that coordinates multiple tool calls with policy, guardrails, retries, and output structure.

Little's Law: The Secret Formula for System Performance
TLDR: Little's Law (\(L = \lambda W\)) connects three metrics every system designer measures: \(L\) = concurrent requests in flight, \(\lambda\) = throughput (RPS), \(W\) = average response time. If l

Fine-Tuning LLMs: The Complete Engineer's Guide to SFT, LoRA, and RLHF
TLDR: A pretrained LLM is a generalist. Fine-tuning makes it a specialist. Supervised Fine-Tuning (SFT) teaches it your domain's language through labeled examples. LoRA does the same with 99% fewer tr
Discovery shortcuts
Move into Discovery
System Design
Backend Systems
AI Infrastructure
Distributed Systems
Tradeoff Reasoning
Recent chapters
Editorial reading rhythm
283 chapters
Featured chapter
ANN Index Types Explained: When to Choose Flat, HNSW, IVF, or IVF-PQ
TLDR: If your dataset is small and correctness is critical, use Flat. If you need high recall with low latency and enough RAM, use HNSW. If your corpus is huge and memory is your bottleneck, use IVF-P
Read chapter
Softmax Function Explained: From Raw Scores to ProbabilitiesTLDR: Softmax converts a vector of raw scores (logits) into a valid probability distribution by exponentiating each value and dividing by the total. Subtracting the max before exponentiating prevents 23 minSystems exploration
Follow the concept continuity
Browse all chapters
Softmax Function Explained: From Raw Scores to ProbabilitiesTLDR: Softmax converts a vector of raw scores (logits) into a valid probability distribution by exponentiating each value and dividing by the total. Subtracting the max before exponentiating prevents 23 min readContinue
Read one chapter, then follow the next related system.
Open Discovery
