The next #PyData #Helsinki #meetup is on 19 March 2026 at Elisa, Ratavartijankatu 5.

On the menu: optimisation problems in electricity markets, a deep dive into #DeltaLake read performance without a cluster, two lightning talks, and an intro to Data & AI at our sponsor Elisa.

And of course our already traditional quiz, where the main challenge is understanding how answers are scored.

https://www.meetup.com/pydatahelsinki/events/313128987/?isFirstPublish=true

#DataScience #DataEngineering #Python

PyData Helsinki meetup at Elisa, Thu, Mar 19, 2026, 5:00 PM | Meetup

We're excited to announce that the next PyData Helsinki meetup will be hosted by **Elisa** on Thursday, 19 March 2026! **Important:** Because of venue security requirement

Meetup

My most frequently asked question in 2025 was: how do you design for
high-throughput ingestion with #deltalake

Over the last week I took the time to write down exactly how I design for throughput in this blog post: https://www.buoyantdata.com/blog/2026-01-02-design-for-throughput.html

High-throughput data ingestion with the Buoyant Architecture

Delta Lake allows for building high-throughput applications, especially for append-only workloads as part of a medallion architecture. In this post we review the high-throughput data ingestion architecture deployed by Buoyant Data using oxbow. Separating write and transaction management for efficiency when bringing data into the bronze layer.

Spark SQL for Data Engineering 1 : I am going to start spark sql sessions as series. #sparksql

Spark SQL Part 1 : I am going to start spark sql sessions as series. #sparksql #deltalake #pyspark ' Databricks Notebooks code for ... source

https://quadexcel.com/wp/spark-sql-for-data-engineering-1-i-am-going-to-start-spark-sql-sessions-as-series-sparksql/

Quarkusで作る小売向け統計分析API - Qiita

はじめに TRIAL&RetailAI Advent Calendar 2025 の 22日目の記事です。 昨日は@akaitigoさんの『Quarkus+Kotlin開発向けAgentSkillsを作ってみた。プロジェクト生成+エラー分析スキル』 という記事でした。 私...

Qiita

🎉 Excited to Share a Milestone!

I’m thrilled to announce that I’ve successfully earned the Databricks Academy Platform Administrator accreditation! This achievement validates my ability to manage and administer the Databricks Lakehouse Platform effectively.

Looking forward to continuing to explore advanced analytics, AI/ML integration, and generative AI workflows within Databricks.

#Databricks #DataEngineering #Spark #DeltaLake #UnityCatalog

https://credentials.databricks.com/bb9c88f7-d446-4a30-9115-b7bf5df5567c#acc.XmSM63zT

Accredible • Certificates, Badges and Blockchain

Home of digital credentials

Accredible • Recipient Portal

^ Less then a day to sign up and get our take on #SAP Business Data Cloud and the Delta Sharing ecosystem

#databricks #businessdatacloud #deltalake #deltasharing #dremio #webinar

kafka-delta-ingest was the project that spawned the development of #deltalake for #rustlang, also known as delta-rs.

Last week I decommissioned the last of those processes. I have since made our
ingestion even cheaper but kafka-delta-ingest will always hold a spot in our history

https://brokenco.de/2025/10/30/kafka-delta-ingest-was-fun.html

Based Lake, a petabyte-scale low-latency data lake

I had a chat today about building large scale low-latency data retrieval systems around AWS S3. In doing so I got to share a bit of the talk proposal I submitted to Data and AI Summit this year about real-live work that has made it into production.

brokenco.de

Tomorrow at 7am PT (14:00 UTC) I'll be doing some #deltalake hacking with some other 🦀 folks on #deltalive!

https://www.twitch.tv/agentdero/schedule

📣I wrote a blog 🚀📰:

The Data Surrender Trap: How Enterprises Are Losing Control in the AI Gold Rush—and the Simple Fix 👇

https://www.softinio.com/post/the-data-surrender-trap/

#AI #Deltalake #DataEngineering #DataGovernance #ArtificialIntelligence

The Data Surrender Trap: How Enterprises Are Losing Control in the AI Gold Rush—and the Simple Fix

Avoid the data-surrender trap: keep data in-place with open standards and governance, share securely, and bring AI to your data—not the other way around.

Salar Rahmanian
@Schneems I don't know about best practices, but once upon a while ago I wrote why we re-export some symbols in #deltalake https://brokenco.de/2023/07/26/rust-re-export.html
Based Lake, a petabyte-scale low-latency data lake

I had a chat today about building large scale low-latency data retrieval systems around AWS S3. In doing so I got to share a bit of the talk proposal I submitted to Data and AI Summit this year about real-live work that has made it into production.

brokenco.de