Surfalytics
All pet projects
Streaming & Real-time advanced ⏱ 30–40 hours

Data Streaming Mastery

A comprehensive hands-on course from Kafka fundamentals to production-grade real-time pipelines — every module has exercises, solutions, and Docker environments.

KafkaFlinkSpark StreamingReal-timeDocker
View project on GitHub

What you’ll build

A production-grade real-time data pipeline built module by module: Kafka cluster, stream processing with Flink or Spark Structured Streaming, stateful operations, windowing, and sink connectors to a warehouse. Every module includes exercises and solutions.

Skills you’ll practice

  • Kafka: brokers, topics, Kafka Connect, Schema Registry
  • Stream processing: Flink or Spark Structured Streaming
  • Stateful operations: aggregations, joins, windowing
  • End-to-end pipeline: source → Kafka → processor → sink