A truly magic and beautiful generic object identifier (ID) syntax:

⭐️$TIMESTAMP*$CONTEXT*$RANDOM❀️

They auto-cluster related datas in an inavoidable, nor requiring any "data-janitor" actions to work.

It transforms any digital data collection into a #DataSoil, which can easily be shaped into a #DataGarden.

I call them "CollisionFriendly IDs" (or #CFIDs for short)

It also provides auto dup-detection

+ #fuzzy #mnemonic search

+ Stays shorter than default path+filename S3 IDs

#ahalodeck

I have manually created over several hundred-thousand digital filesystem objects on my storage(s).

Even way more blocks, the filesystem must keep assigned and working properly.

The whole world trust the filesystem to keep those bits in order and accessible.

Why not trust the filesystem to do the same with more objects - and split everything into xattr-annotated objects?

It'd be doing the same thing: Simply for /more/ objects.

#AHAlodeck

https://cloud.arkthis.com/index.php/s/AHAlodeck2026?dir=/videos

A 50 minute show-and-tell, about #AHAlodeck, and how #xattrs on #ZFS can be used as #DataCentric catalog/environment.

Enjoy!

Perfect for #longterm preservation #dltp - and any other digital data collections.

#AHAlodeck R&D news:

- Testdataset: 386.5 GB
- 120.000 filesystem objects (files and folders)
- 7616839 key/value informations gathered/extracted as #xattrs.

Result: A 55 MB .tar.bz2 #holotar.

Fully search-and-filter-able by any of the 7 Mio xattrs.

Extraction step required only once (6h)
Afterwards:
Indexing done in 60 SECONDS! 🀩 πŸ₯‡

Same data (key/value pairs) viewed in different application:

`apt install eiciel`

That's a #DataCentric paradigm in action! You can literally shoot any mixed (annotated) data at that.

It really puts fun back into using data daily.

#AHAlodeck

Handed in my presentation proposal for #nttw10 #notimetowait

Presenting "#DataGardens" 🌻️🌻️🌻️
Powered by #AHAlodeck to host any arbitrary data, offload key/value to filesystem - and transform datas into annotated related object graphs.

Pure "#ScienceFaction"!
#WeAllHaveBigDataNow

in #ZFS on #Linux, #Proxmox, #Debian, #Ubuntu #FOSS #FinF #ObjectStorage

Download #wikidata dump for testing #ahalodeck and #xattrs on #zfs long-term storage... 😎

Found this while doing so:

https://www.wikidata.org/wiki/Wikidata:Lists/lexemes/en

It's a statistical breakdown of the #English #language. #Interesting.

Wikidata:Lists/lexemes/en - Wikidata

#AHAlodeck R&D news:

Created holotar-copy of a real-world mixed audio collection (recording production and digitized tracks archive): ~120.000 filesystem objects (files/folders)

(donated by recording engineers)

filesystem metadata-only = 175 MB (.tar)
+exiftool metadata as xattrs = 252 MB (.tar)

...compressed (.tar.bz2): **6,3 MB** ( 🀯 😎 πŸ€“ πŸ’Ύ ❗)

**This is AMAZING!**

I can browse, order, catalog and manifest any object in this collection using standard #GNU #Linux tools.
@beet_keeper

#AHAlodeck: My best friend wrote a mini-basic fulltext indexer for #xattrs in the shell.

Took ~1.5 seconds (!) to index ~14.000 files with "de-embedded" music tags.

πŸ₯³ πŸŽ‰
I love xattrs.
Imagine doing this with embedded metadata tags? πŸ˜‰

https://git-annex.branchable.com/git-annex-metadata/

git seems to have embedded key-value #metadata annotation capabilities... 😻 #AHAlodeck ahoi!

git-annex-metadata