Abstract AlgorithmsAbstract Algorithms

  • Home
  • All Posts
  • All Series
  • About

Category

apache spark

3 articles

Big Data Engineering: Your Complete Learning Roadmap

TLDR: πŸ—ΊοΈ You want to learn Big Data Engineering, but the ecosystem feels overwhelming. This roadmap breaks down 11 posts across 4 phases: Foundations β†’ Architecture β†’ Pipelines β†’ Advanced. Start with the 5 Vs and Data Lakes, then tackle Lambda Archi...

Mar 28, 2026β€’18 min read
Modern Table Formats: Delta Lake vs Apache Iceberg vs Apache Hudi

Modern Table Formats: Delta Lake vs Apache Iceberg vs Apache Hudi

TLDR: Delta Lake, Apache Iceberg, and Apache Hudi are open table formats that wrap Parquet files with a transaction log (or snapshot tree) to deliver ACID guarantees, time travel, schema evolution, an

Mar 28, 2026β€’27 min read

Apache Spark for Data Engineers: RDDs, DataFrames, and Structured Streaming

TLDR: Apache Spark distributes Python DataFrame jobs across a cluster of executors, using lazy evaluation and the Catalyst query optimizer to process terabytes with the same code that works on gigabytes. Master partitioning, shuffle-awareness, and St...

Mar 28, 2026β€’20 min read

Abstract Algorithms

Exploring the fascinating world of algorithms, data structures, and software engineering through clear explanations and practical examples.

Navigation

  • Home
  • All Posts
  • All Series
  • About

Series

  • Software Engineering Principles
  • Machine Learning Fundamentals
  • Low-Level Design Guide
  • LLM Engineering
  • How It Works: Internals Explained
  • Data Structures and Algorithms
  • Big Data Engineering
  • Architecture Patterns for Production Systems
  • Agentic AI: LangChain and LangGraph

Popular Topics

  • Roadmap
  • learning path
  • langchain
  • Python
  • llm
  • big data

Author

Abstract Algorithms

Abstract Algorithms

@abstractalgorithms

Β© 2026 Abstract Algorithms. All rights reserved.

Powered by Hashnode