Home

Topic

architecture

66 articles across 31 sub-topics

Sub-topic

Microservices Architecture: Decomposition, Communication, and Trade-offs

TLDR: Microservices let teams deploy and scale services independently — but every service boundary you draw costs you a network hop, a consistency challenge, and an operational burden. The architecture pays off only when your team and traffic scale h...

21 min read
System Design HLD Example: Web Crawler

System Design HLD Example: Web Crawler

TLDR: A distributed web crawler must balance global throughput with per-domain politeness. The architectural crux is the URL Frontier, which manages priority and rate-limiting across a distributed fetcher pool. By combining Bloom Filters for URL dedu...

17 min read

System Design HLD Example: Distributed Job Scheduler

TLDR: A distributed job scheduler ensures tasks fire reliably using a durable Job Store with a next_fire_time index. To handle multiple scheduler instances without double-firing, we use optimistic row-level locking (UPDATE WHERE status='SCHEDULED'). ...

16 min read

Distributed Transactions: 2PC, Saga, and XA Explained

TLDR: Distributed transactions require you to choose a consistency model before choosing a protocol. 2PC and XA give atomic all-or-nothing commits but block all participants on coordinator failure. Saga gives eventual consistency with explicit compen...

25 min read

Modernization Architecture Patterns: Strangler Fig, Anti-Corruption Layers, and Modular Monoliths

TLDR: Large-scale modernization usually fails when teams try to replace an entire legacy platform in one synchronized rewrite. The safer approach is to create seams, translate old contracts into stable new ones, and move traffic gradually with measur...

12 min read

Integration Architecture Patterns: Orchestration, Choreography, Schema Contracts, and Idempotent Receivers

TLDR: Integration failures usually come from weak contracts, unsafe retries, and missing ownership rather than from choosing the wrong transport. Orchestration, choreography, schema contracts, and idempotent receivers are patterns for making cross-bo...

14 min read

Sub-topic

Big Data

6 articles

Medallion Architecture: Bronze, Silver, and Gold Layers in Practice

Medallion Architecture: Bronze, Silver, and Gold Layers in Practice

TLDR: Medallion Architecture solves the "data swamp" problem by organizing a data lake into three progressively refined zones — Bronze (raw, immutable), Silver (cleaned, conformed), Gold (aggregated, business-ready) — so teams always build on a trust...

22 min read
Kappa Architecture: Streaming-First Data Pipelines

Kappa Architecture: Streaming-First Data Pipelines

TLDR: Kappa architecture replaces Lambda's batch + speed dual codebases with a single streaming pipeline backed by a replayable Kafka log. Reprocessing becomes replaying from offset 0. One codebase, no drift. TLDR: Kappa is the right call when your t...

20 min read
Big Data 101: The 5 Vs, Ecosystem, and Why Scale Breaks Everything

Big Data 101: The 5 Vs, Ecosystem, and Why Scale Breaks Everything

TLDR: Traditional databases fail at big data scale for three concrete reasons — storage saturation, compute bottleneck, and write-lock contention. The 5 Vs (Volume, Velocity, Variety, Veracity, Value) frame what makes data "big." A layered ecosystem ...

20 min read

Lambda Architecture Pattern: Balancing Batch Accuracy with Streaming Freshness

TLDR: Lambda architecture is justified when replay correctness and sub-minute freshness are both non-negotiable despite dual-path complexity. TLDR: Lambda architecture is a fit only when you need both low-latency views and deterministic recompute fro...

13 min read

Big Data Architecture Patterns: Lambda, Kappa, CDC, Medallion, and Data Mesh

TLDR: A serious data platform is defined less by where files are stored and more by how changes enter the system, how serving layers are materialized, and who owns quality over time. Lambda, Kappa, CDC, Medallion, and Data Mesh are patterns for makin...

15 min read

Data Warehouse vs Data Lake vs Data Lakehouse: Which One to Choose?

TLDR: Warehouse = structured, clean data for BI and SQL dashboards (Snowflake, BigQuery). Lake = raw, messy data for ML and data science (S3, HDFS). Lakehouse = open table formats (Delta Lake, Iceberg) that bring SQL performance to raw storage — the ...

14 min read

Sub-topic

Hld

5 articles

System Design HLD Example: Real-Time Leaderboard

TLDR: Real-time leaderboards for 10M+ active users require an in-memory ranking engine. Redis Sorted Sets (ZSET) are the industry standard, providing $O(\log N)$ updates and rank lookups via an internal Skip List data structure. Relational databases ...

15 min read

System Design HLD Example: Hotel Booking System (Airbnb)

TLDR: A robust hotel booking system must guarantee atomicity in inventory subtraction. The core trade-off is Consistency vs. Availability: we prioritize strong consistency for the booking path (PostgreSQL with Optimistic Locking) while allowing event...

14 min read
System Design HLD Example: URL Shortener (TinyURL and Bitly)

System Design HLD Example: URL Shortener (TinyURL and Bitly)

TLDR: A URL shortener is a read-heavy system (100:1 ratio) that maps long URLs to short, unique aliases. The core scaling challenge is generating unique IDs without database contention—solved using a Range-Based ID Generator or a Distributed Counter ...

17 min read

System Design HLD Example: Search Autocomplete (Google/Amazon)

TLDR: Search autocomplete must respond in sub-10ms to feel "instant." The core trade-off is Latency vs. Data Freshness: we use an offline pipeline (Spark) to pre-calculate prefix-to-suggestion mappings and store them in Redis Sorted Sets (or a specia...

14 min read

System Design HLD Example: News Feed (Home Timeline)

TLDR: A news feed system builds personalized timelines by combining content publishing, graph relationships, and ranking. The scalability crux is the fan-out amplified write path: a single celebrity post can trigger 100M writes. A hybrid fan-out stra...

18 min read