🌟 Just wrapped up the homework for Batch 5 of the Zoomcamp!
I processed and analyzed the yellow_tripdata_2024-10.parquet and taxi_zone_lookup.csv datasets using PySpark and Spark SQL. Feels great to finish a hands-on project! 🏆
#DataEngineering #Zoomcamp #DataTalks #ETL #PySpark #SparkSQL
📈 Spark SQL is amazing!
Today I worked on SQL queries within PySpark to analyze and transform large datasets. This is such a powerful tool for data engineering! 🚀
#DataEngineering #Zoomcamp #PySpark #DataTalks #SparkSQL
💥 Today, I started using Spark on GCP with PySpark.
Worked with yellow_tripdata_2024-10.parquet and taxi_zone_lookup.csv to process data. Learning how Spark handles big data in the cloud is incredible! 🚗
#DataEngineering #Zoomcamp #DataTalks #PySpark #BigData #Spark
🚀 I’ve just started the Zoomcamp Data Engineering by @DataTalksClub!
This module focuses on ETL processing with Spark, Spark SQL, and DataFrames. Excited to dive into big data processing and learn how to use Spark at scale! 🔥
#DataEngineering #Zoomcamp #DataTalks #PySpark

Happy New Year to all! 🎉 Starting 2024 with a bang, we've just posted the final sessions of our DataTopics #RootsConf interviews. Delve into the dynamic world of AI and Data Insights with our latest podcast episode. 🤖💼 🎧 #DataTalks #TechTrends2024

https://www.datatopics.io/

DataTopics Unplugged

Welcome to the cozy corner of the tech world where ones and zeros mingle with casual chit-chat. Datatopics Unplugged is your go-to spot for relaxed discussions around tech, news, data, and society.Dive into conversations that should flow as smooth...

Buzzsprout