Surfalytics
All pet projects
Data Engineering beginner ⏱ 4–6 hours

Polars: Fast DataFrame Library

Learn Polars — the Rust-based DataFrame library that outperforms Pandas on large datasets with a clean, expressive API.

PolarsPythonDataFramesPerformance
View project on GitHub

What you’ll build

A set of data manipulation exercises in Polars covering loading, filtering, grouping, joins, and window functions — all benchmarked against equivalent Pandas code. A strong addition to any Python data engineering portfolio.

Skills you’ll practice

  • Polars eager vs lazy API (LazyFrame)
  • Expressions: select, filter, groupby, join, window
  • Reading Parquet, CSV, and JSON efficiently
  • When to use Polars vs Pandas vs Spark