Cold data is read-only - that's the assumption baked into basically every tiering solution on the market. ColdFront breaks it.

UPDATE & DELETE on archived rows work through standard SQL. A GDPR deletion on five-year-old events is a single DELETE statement. No restore cycle.

DuckDB in-process, Apache Iceberg on any S3. Stock unpatched #PostgreSQL 16/17/18. Beta now, PostgreSQL License. Led by Jimmy Angelakos.

πŸ“– https://github.com/pgEdge/coldfront

#OpenSource #DataEngineering #ApacheIceberg #DuckDB

⚑ Hai un CSV da 2 GB e vuoi interrogarlo in SQL senza installare niente.
DuckDB lo carica, lo analizza e si aggancia a pandas in tre righe. πŸ‘‡
https://gomoot.com/duckdb-analizza-file-csv-json-e-parquet-senza-server-ne-configurazione/

#dati #duckdb #opensource #sql

pdo_duckdb: a native PDO driver for DuckDB, the in-process analytical database. Query DuckDB's columnar engine through the standard PDO API, with a native appender for bulk loads and optional result streaming for big scans.

https://github.com/iliaal/pdo_duckdb
Write-up: https://ilia.ws/blog/pdo-duckdb-a-pdo-driver-for-duckdb

#php #duckdb #pdo

GitHub - iliaal/pdo_duckdb: PDO driver for DuckDB, the in-process analytical database.

PDO driver for DuckDB, the in-process analytical database. - iliaal/pdo_duckdb

GitHub

ElePHPants now also love Ducks! 🐘 πŸ’“ πŸ¦†

After using DuckDB for many years successfully in production, I've decided to build a native PHP DuckDB PDO driver:

https://github.com/thomas-0816/pdo-duckdb-php

#php #duckdb

DuckDB Internals: Why is DuckDB Fast?

Walk through DuckDB's internals: how it skips serialization overhead, parses and optimizes SQL, and stores data in columnar row groups with zone maps.

pgEdge ColdFront: #PostgreSQL data tiering. Hot data in the heap, cold to Apache Iceberg on S3 - up to 90% lower storage cost.

The cold tier is writable. UPDATE & DELETE on cold rows work in standard SQL. No restore cycle, no rehydration. No app changes.

DuckDB runs in-process. No daemon, no sidecar. PostgreSQL License, beta now. Led by @vyruss.

Press release: πŸ‘‰ https://www.pgedge.com/press-releases/pgedge-announces-coldfront-for-postgresql

GitHub: πŸ”— https://github.com/pgEdge/coldfront

#OpenSource #DataEngineering #ApacheIceberg #DuckDB

a-case-for-sqlite-as-a-file-format

Everything seems to be turning columnar. It's not just parquet, but also Arrow and DuckDB.

I'm still trying to grasp how it works, but it's an interesting development.

#columnar #parquet #arrow #duckdb

DuckDB: the flashiest thing since the iPhone-in-a-box stunt. πŸ¦†πŸ’¨ Let’s spend 18 minutes marveling at how a research project became the #database equivalent of avocado toastβ€”every startup's favorite. πŸžπŸ“ˆ
https://www.greybeam.ai/blog/duckdb-internals-part-1 #DuckDB #AvocadoToast #Innovation #Startups #ResearchProject #HackerNews #ngated
DuckDB Internals: Why is DuckDB Fast?

Walk through DuckDB's internals: how it skips serialization overhead, parses and optimizes SQL, and stores data in columnar row groups with zone maps.

DuckDB Internals: Why is DuckDB Fast?

Walk through DuckDB's internals: how it skips serialization overhead, parses and optimizes SQL, and stores data in columnar row groups with zone maps.