Hi All! I started posting my data engineering learning journey and thought it would be great share here as well!

🚀 Week 3 of Data Engineering Zoomcamp by DataTalksClub complete! I'm really enjoying how hands on and practical the course is so far!

This week I focused on data warehousing with #Google #BigQuery. Coming from the world of #Microsoft Azure, it was a great experience to get familiar with BigQuery's serverless architecture and how it manages and processes big data at scale. Here's what I learned:

✅️ Created external tables from GCS bucket data sources (CSV/Parquet)
✅️ Use partitioning/clustering to save on cost & enhance speed of processing SQL queries
✅️ Used both #Docker & #Kestra to orchestrate the extraction, transfer, and loading 20+ million NYC taxi data at scale into a GCS bucket
✅️ Understand the advantages of columnar storage and query optimization

Check out my work here: https://github.com/ammartin8/data_engineering_zoom_camp/blob/main/modules/module_3/project_03/README.md

#googlecloud #dataengineering #microsoft #cloud #bigdata #dataanalytics #fedihire #linux #data

After migrating Kestra from Java 21 to Java 25, we see a significant improvement in memory usage.
It uses 35% less heap and 12% less metaspace!
Upgrading always brings benefits ;)
#java #kestra
https://github.com/kestra-io/kestra/pull/14221
Chore/java 25 by loicmathieu · Pull Request #14221 · kestra-io/kestra

GitHub
Stop fighting complex data pipelines. Kestra is an open-source, event-driven orchestrator that's infinitely scalable. 🧩 Define workflows with simple YAML or a no-code UI, then sync with Git. #opensource #kestra #orchestration #dataops #devops

Looking for a cheaper or free/self-hosted alternative to #zapier.

Anyone got some direction to point me to?

I’ve seen #n8n and #Automatisch and #kestra and they all seem nice but have varying and very limited integration with other services (the one big area Zapier wins.. but the price is just ridiculous for a private individual)

Spun up an LXC and installed #kestra, an “Open Source, Declarative Orchestration Platform” (https://kestra.io/). The little youtube influencer videos and kestra’s own tutorial videos didn’t indicate how many little feature locks 🔐 were littering the application…

Like there aren’t enough orchestration engines… sheesh.

Akkoma

In meinem neuen Blogartikel zeige ich, wie das Open-Source-Orchestrierungstool Kestra hilft, komplexe Prozesse effizient zu gestalten – YAML-basiert, flexibel und bereit für die Cloud.

🔧 Ideal für DevOps, Datenpipelines oder als AI Agent ersatz.

📘 Jetzt lesen: https://www.marcogriep.de/posts/kestra-orchestrierungs-tool-zur-optimierung-von-workflows/

#Kestra #WorkflowAutomation #DevOps #OpenSource #TechBlog

Kestra - Orchestrierungs-Tool zur Optimierung von Workflows

Datenpipelines müssen nicht nur robust und skalierbar sein, sondern auch einfach zu verwalten und zu warten. Genau hier setzt Kestra an – ein Orchestrierungs-Tool, das die Erstellung und Verwaltung von Workflows vereinfacht.

Griep Marco - IT-Beratung, Softwareentwicklung & IT-Trainings
✅ Just finished Module 2 of the #DataEngineering Zoomcamp!
I’ve built workflow automation with Kestra and learned how to schedule and monitor jobs.
Loving the simplicity of its declarative approach!
#Kestra #Orchestration #DataPipelines #DataTalksClub #Zoomcamp
💡 Exploring Kestra in the Zoomcamp!
Its YAML-based workflows make automation intuitive and scalable.
I just scheduled my first data pipeline—this tool is promising!
#Kestra #WorkflowOrchestration #DataPipelines #DataTalksClub #zoomcamp
🚀 I’ve just started Module 2 - Workflow Orchestration in the #DataEngineering Zoomcamp!
This week is all about Kestra, a modern workflow orchestrator.
Looking forward to automating data pipelines with it! 🔥
#Kestra #DataPipelines #DataEngineering #DataTalksClub #zoomcamp

Amazing weeks learning about workflow orchestration using #kestra.
Thanks to Will Russell for his amazing master classes and #DataTalksClub for organizing it.

Please join if you want to have fun with data engineering tasks 😀