Title: P1: MLOps course, MLFlow [2024-05-29 Wed]
And I wrote code that enhance ⚑️ word-capitalize ⚑️
command to capitalize first letter of current line and
current word.
😢 #dailyreport #mlopszoomcamp #zoomcamp #emacs #ml #mlops

Title: P0: MLOps course, MLFlow [2024-05-29 Wed]
Today I have been learning MLFlow and doing homework in
MLOps Zoomcamp course. ☯

MLFlow is much more interesting for me than Kubeflow.
MLFlow just store artifacts and all data and track
experiments. (❀ˆᴗˆ)

Next target for me is to learn DVC and A/B testing. β›³

For Emacs I learned how to execute code remotely from Org
src block, you just add :dir with /ssh:host: β§‚ #dailyreport #mlopszoomcamp #zoomcamp #emacs #ml #mlops

πŸš•πŸ’‘ The model is up and running! It predicts ride durations for NY Yellow Taxi trips, and I’m loving the MLOps journey. Now focusing on deploying the model and automating the process. #DataScience #AI #MachineLearning #MLOps #ZoomCamp #DataTalksClub
πŸ“ŠπŸ’» Just completed the linear regression model to predict ride durations based on data from Jan-Feb 2023. Now on to tuning and integrating the model into a Docker container. Next steps ahead! #MachineLearning #DataScience #MLOps #ZoomCamp #DataTalksClub
πŸ—½πŸš– Starting with the NY Yellow Taxi dataset from Jan-Feb 2023! Preparing to build a regression model to predict ride durations. Time to dive into the data and start exploring! #MLOps #ZoomCamp #DataTalksClub #MachineLearning
🌟 Just wrapped up the homework for Batch 5 of the Zoomcamp!
I processed and analyzed the yellow_tripdata_2024-10.parquet and taxi_zone_lookup.csv datasets using PySpark and Spark SQL. Feels great to finish a hands-on project! πŸ†
#DataEngineering #Zoomcamp #DataTalks #ETL #PySpark #SparkSQL
πŸ“ˆ Spark SQL is amazing!
Today I worked on SQL queries within PySpark to analyze and transform large datasets. This is such a powerful tool for data engineering! πŸš€
#DataEngineering #Zoomcamp #PySpark #DataTalks #SparkSQL
πŸ’₯ Today, I started using Spark on GCP with PySpark.
Worked with yellow_tripdata_2024-10.parquet and taxi_zone_lookup.csv to process data. Learning how Spark handles big data in the cloud is incredible! πŸš—
#DataEngineering #Zoomcamp #DataTalks #PySpark #BigData #Spark
πŸš€ I’ve just started the Zoomcamp Data Engineering by @DataTalksClub!
This module focuses on ETL processing with Spark, Spark SQL, and DataFrames. Excited to dive into big data processing and learn how to use Spark at scale! πŸ”₯
#DataEngineering #Zoomcamp #DataTalks #PySpark
πŸ”§ Now we’re building the pipeline! πŸ› οΈ Transforming the #NYCTaxi data into something useful by processing it with DLT and sending it to DuckDB. πŸš‚πŸ’Ύ Stay tuned as we turn raw data into insights! #DataEngineering #DuckDB #DLT" #Zoomcamp