Topic
Data Engineering
Learn Data Engineering as a connected topic across chapters, concepts, simulations, and interview reasoning.
10 Concepts17 Articles5h 26m
Overview
Learn Data Engineering as a connected topic across chapters, concepts, simulations, and interview reasoning.
How this topic helps
Architecture
Big Data
System Design
#apache Spark
Learning Path in this Topic
Series that contain articles from Data Engineering. Select a path to filter the article list.
Articles
17 matched articles
Article 4Modern Table Formats: Delta Lake vs Apache Iceberg vs Apache HudiTLDR: Delta Lake, Apache Iceberg, and Apache Hudi are open table formats that wrap Parquet files with a transaction log (or snapshot tree) to deliver ACID guarantees, time travel, schema evolution, an24 min
Article 5Medallion Architecture: Bronze, Silver, and Gold Layers in PracticeTLDR: Medallion Architecture solves the "data swamp" problem by organizing a data lake into three progressively refined zones — Bronze (raw, immutable), Silver (cleaned, conformed), Gold (aggregated, 23 min
Article 6Kappa Architecture: Streaming-First Data PipelinesTLDR: Kappa architecture replaces Lambda's batch + speed dual codebases with a single streaming pipeline backed by a replayable Kafka log. Reprocessing becomes replaying from offset 0. One codebase, n21 minPage 1 of 3