Surfalytics
All pet projects
Data Engineering intermediate ⏱ 6–8 hours

Trino: Distributed SQL Query Engine

Deploy Trino via Docker and standalone mode, then connect it to multiple data sources and use the trino-dbt connector.

TrinoDockerSQLDistributed Query
View project on GitHub

What you’ll build

A running Trino cluster (both Docker and standalone installs) querying data across multiple sources. You’ll wire up the trino-dbt connector and run cross-source SQL queries — the kind of setup used at companies that federate data across many systems.

Skills you’ll practice

  • Trino architecture: coordinator, workers, catalogs
  • Deploying distributed systems locally with Docker
  • Federated queries across heterogeneous data sources
  • Connecting dbt to Trino for transformations