🎩 Ah, behold the "DuckDB #community extension for prefiltered #HNSW using ACORN-1" — because the alphabet soup of #developer jargon just wasn't dense enough. 🤦‍♂️ GitHub has unleashed yet another "game-changer" to ensure your code doesn't just run, but performs a high-wire act while juggling flaming chainsaws. 🔥🤹‍♀️
https://github.com/cigrainger/duckdb-hnsw-acorn #DuckDB #ACORN1 #gamechanger #HackerNews #ngated
GitHub - cigrainger/duckdb-hnsw-acorn

Contribute to cigrainger/duckdb-hnsw-acorn development by creating an account on GitHub.

GitHub
GitHub - cigrainger/duckdb-hnsw-acorn

Contribute to cigrainger/duckdb-hnsw-acorn development by creating an account on GitHub.

GitHub

Show HN: DuckDB community extension for prefiltered HNSW using ACORN-1

https://github.com/cigrainger/duckdb-hnsw-acorn

#duckdb #github

Victor M (@victormustar)

Hugging Face에서 hf-mount가 공개되었다. 5TB급 데이터셋을 로컬 폴더처럼 마운트해 필요한 부분만 DuckDB로 질의하고, 모델 저장소를 ls/cat처럼 탐색할 수 있는 등 대용량 데이터·모델 작업을 더 편하게 해주는 새로운 도구로 소개됐다.

https://x.com/victormustar/status/2036476453370380416

#huggingface #hfmount #duckdb #opensource #aiinfrastructure

Victor M (@victormustar) on X

Now available on Hugging Face: hf-mount 🧑‍🚀 The team really cooked, still wrapping my head everything possible but you can do things like: - mount a 5TB dataset as a local folder and query only the parts you need with DuckDB (✅ works) - browse any model repo with ls/cat like

X (formerly Twitter)
Anyone else having a load of breaking changes with the new geometry handling in #duckdb >1.5.0? #rspatial
Announcing DuckDB 1.5.0

We are releasing DuckDB version 1.5.0, codenamed “Variegata”. This release comes with a friendly CLI (a new, more ergonomic command line client), support for the VARIANT type, a built-in GEOMETRY type, along with many other features and optimizations. The v1.4.0 LTS line (“Andium”) will keep receiving updates until its end-of-life in September 2026.

DuckDB
DuckDB 1.5 with spatial updates: DuckDB 1.5 arrives with significant updates for #geospatial: GEOMETRY becomes a built-in data type, storage switches to #WKB with shredding for better compression, #CRS awareness is now part of the type system, and more! Here is a rundown of the most relevant new features in #DuckDB 1.5.
https://spatialists.ch/posts/2026/03/22-duckdb-15-with-spatial-updates/ #GIS #GISchat #geospatial #SwissGIS
DuckDB 1.5 with spatial updates – Spatialists – geospatial news

DuckDB 1.5 arrives with significant updates for #geospatial: GEOMETRY becomes a built-in data type, storage switches to #WKB with shredding for better compression, #CRS awareness is now part of the type system, and more! Here is a rundown of the most relevant new features in #DuckDB 1.5.

Spatialists – geospatial news
🌕 Hacker News 完整數據存檔:Hugging Face 開源數據集詳解
➤ 實時更新的技術社羣數位史料庫
https://huggingface.co/datasets/open-index/hacker-news
本文介紹了託管於 Hugging Face 的 Hacker News 完整數據存檔專案。該數據集收錄了自 2006 年至今的所有論壇內容,包含貼文、評論及各類交互數據,總數超過 4,700 萬筆。透過自動化的管道,系統每 5 分鐘同步一次最新數據,確保開發者能獲取即時資訊。資料以 Parquet 格式儲存,並按年月分層管理,極大提升了查詢與分析的效率,是研究科技趨勢與討論熱度的極佳數據源。
+ 終於有一個能夠直接透過 SQL 查詢的 Hacker News 數據集了,對於想分析多年技術趨勢的工程師來說非常實用。
+ 每 5 分鐘同步一次的架構很穩健,透過 Parquet 分層儲存的作法,既節省了儲存成本,也讓後續的數據挖掘變得輕鬆許多。
#資料工程 #開源數據集 #Hacker News #Python #DuckDB
open-index/hacker-news · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

¡Pruebas brutales con DuckDB! El MacBook Neo (el más barato de Apple) rivaliza con servidores en la nube hasta 4x más RAM en cargas de bases de datos pesadas. En cold runs, ¡hasta 2.8x más rápido que instancias cloud! Big Data en tu mochila 💻🚀 #MacBookNeo #DuckDB #AppleSilicon

Ever tried filtering on COUNT() with WHERE and hit a confusing error?

The problem: WHERE runs before GROUP BY, so aggregate functions don't exist yet. That's where HAVING comes in. It filters after grouping.

WHERE = filter individual rows
HAVING = filter aggregates

Read more: https://jamalhansen.com/blog/having-filtering-grouped-results

#SQL #Python #DuckDB #DataScience #Programming

HAVING: Filtering Grouped Results

WHERE filters rows before grouping; HAVING filters after. Need "only cities with more than 10 customers"? That's HAVING.

Jamal Hansen