Antonio Alvarado Hernández

@tnotstar
1 Followers
35 Following
46 Posts
Yet another defector programmer!!
homepagehttps://tnotstar.dev

🚀 Just finished my #DEZoomcamp project! I built an end-to-end pipeline to process population frequencies from the European Variation Archive (EVA).

The Stack:
🛠️ Orchestration: #Bruin (Asset-based & lightweight)
📥 Ingestion: On-the-fly Python filtering
⚡ DWH: #ManticoreSearch for sub-second variant lookups
📊 UI: #Gradio dashboard
🐳 Env: #Docker & Codespaces & Cloud Run

Efficiency > Big Budgets. 🧬

🔗 https://github.com/tnotstar/data-engineering-zoomcamp-2026-project-attempt-1

#DataEngineering #Python #OpenSource #LearningInPublic #DataTalksClub

GitHub - tnotstar/data-engineering-zoomcamp-2026-project-attempt-1: First project attempt for Data Engineering Zoomcamp

First project attempt for Data Engineering Zoomcamp - tnotstar/data-engineering-zoomcamp-2026-project-attempt-1

GitHub

Module 7 of Data Engineering Zoomcamp done!

- Kafka producers and consumers
- PyFlink tumbling and session windows
- Real-time taxi data analysis
- Redpanda as Kafka replacement

My solution: https://github.com/tnotstar/data-engineering-zoomcamp-2026-07-streaming

Free course by @DataTalksClub: https://github.com/DataTalksClub/data-engineering-zoomcamp/

GitHub - tnotstar/data-engineering-zoomcamp-2026-07-streaming: Seventh homework for Data Engineering Zoomcamp

Seventh homework for Data Engineering Zoomcamp. Contribute to tnotstar/data-engineering-zoomcamp-2026-07-streaming development by creating an account on GitHub.

GitHub

⚡ Module 6 of Data Engineering Zoomcamp done!

- Batch processing with Spark 🔥
- PySpark & DataFrames
- Parquet file optimization
- Spark UI on port 4040

My solution: https://github.com/tnotstar/data-engineering-zoomcamp-2026-06-batch

Free course by @DataTalksClub: https://github.com/DataTalksClub/data-engineering-zoomcamp

#dezoomcamp

dlt Workshop of Data Engineering Zoomcamp done!

- REST API pipelines with @dltHub
- AI-assisted pipeline building
- DuckDB as local data warehouse
- dlt Dashboard & marimo notebooks

My solution: https://github.com/tnotstar/data-engineering-zoomcamp-2026-workshop-1

Free course by @DataTalksClub: https://github.com/DataTalksClub/data-engineering-zoomcamp/

GitHub - tnotstar/data-engineering-zoomcamp-2026-workshop-1: First workshop for Data Engineering Zoomcamp

First workshop for Data Engineering Zoomcamp. Contribute to tnotstar/data-engineering-zoomcamp-2026-workshop-1 development by creating an account on GitHub.

GitHub

Module 5 of Data Engineering Zoomcamp done! 🫡

- Data Platforms with Bruin
- End-to-end ELT pipelines
- Data quality & lineage
- Deployment to BigQuery

Free course by @DataTalksClub: https://github.com/DataTalksClub/data-engineering-zoomcamp/

#dezoomcamp

GitHub - DataTalksClub/data-engineering-zoomcamp: Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼 - DataTalksClub/data-engineering-zoomcamp

GitHub

Just wrapped up Module 4 of the #dezoomcamp with @DataTalksClub!

Dived deep into Analytics Engineering with dbt:
* Built production-ready models for NYC taxi data
* Managed lineage for 43M+ FHV records
* Wrote data tests to catch schema drifts

The "T" in ELT is powerful! 🚀

#DataEngineering #dbt #learninginpublic

What a hard day!! Now submitting my first homework for #dezoomcamp by @DataTalksClub 💯
Finally, the third project review for the #aidevtools zoomcamp by @DataTalksClub! 🥳
Now, the second one project for the #aidevtools zoomcamp by @DataTalksClub! 🫡
Per-reviewed my first classmate's project for the #aidevtools zoomcamp by @DataTalksClub! 😅