๐ Milestone Unlocked: Finished the Data Engineering Zoomcamp!
In 10 weeks, I moved from scripting to architecting systems. We built real production-grade infrastructure using Spark, Kafka, Airflow, and Kestraโnot just hobby projects.
Capstone: A Storage Hard Drive Dashboard using real failure data from Backblaze
Stack: Terraform + Docker infra, Airflow orchestration, dbt modeling, Streamlit viz.
Key Lessons:
โ
๏ธ "It works on my laptop" isn't a strategy.
โ
Need IaC, partitioning, clustering, and strict error handling.
โ
dbt ensures reproducible, tested models.
โ
Infra is invisible workโif it breaks, your code fails.
Take the leap! Itโs challenging but by week 10, pieces click into place. Seeing my pipeline run autonomously felt like crossing the finish line. ๐
Thanks Data Talks Club team! On to the next challenge!
My project: https://github.com/ammartin8/hard_drive_analytics_dashboard
#mastodon #fediverse #data #spark #dataengineering #ai #technology #datatools #datapipelines #fedihire #thursday #sql #observability #etl #python #github