Why we do this work…

by @beet_keeper

There aren’t many rewards in a discipline that is about taking the long term view but occasionally something comes up that you can take some pride in.

Last month, Ed Summers put out a call on Mastodon: digipres.club where he was wrestling with a CD-R format that was difficult to recognize. The disks likely held precious data belonging to his late brother.

Much of the search area had already been examined and narrowed down by folks in the community, including Misty de Meo, Roxi Ruuska, Ethan Gates, and Johan van der Knijff who all contributed suggestions and analysis..

Ed was able to share a copy of one of his disk images, and I had some time that I could dedicate to taking a look as well.

Long-story short, we were able to identify the disks, and Ed has written up the background here: https://inkdroid.org/2026/06/12/tascam/

The situation might be familiar to others: a digital file that isn’t recognized by the major file format identification tools, and yet, because of its context, you know it is something that might be important.

I have different experiences with these types of files, sometimes they are valuable (and you want to look after them), sometimes they are not (and it can still benefit you to get rid of them). The process of finding this out often follows a similar path.

In this instance the files turned out to be incredibly valuable and I wanted to elaborate on the path of discovery. Even though it really isn’t very sophisticated, I hope it will be helpful to those with unidentified digital records who might find the task of identifying them quite daunting.

Continue reading “Why we do this work…”


#DAW #digipres #DigitalForensics #digitalHeritage #DigitalPreservation #internetArchive #MusicProduction #personalDigitalArchiving #TASCAM #TEAC #WebArchives #webArchiving

So, the #WaybackMachine does have snapshots of the Institute of Urban Homesteading, whose motto is "Agitate, Educate, Pollinate!"

https://web.archive.org/web/20260000000000*/https://iuhoakland.com/

#SolarPunkSunday #WebArchives #UrbanHomesteading

Wayback Machine

how can we use the internet to study the internet? look forward to co-learning workshop on digital methods for studying internet culture at Seoul National University, 8-9th June 2026 - with Chamee Yang @lbngr Jisu Lee & more! ✨🐰🐨🦔🐢🐬 https://jonathangray.org/2026/05/28/snu-workshop
#seoul #internetstudies #internetarchive #webarchives #digitalmethods
what is the datafied web? how to study it? and reimagine it? we chat in Internet Histories journal ✨🐰🦔🐨🐢🦉🐿️🦙🦆💬 https://jonathangray.org/2026/05/18/datafied-web
with @nthylstrup Miglė Bareikytė, Carolin Gerlitz, @sebgiessmann @annehelmond Ian Milligan & Valérie Schafer.
come for the datafied web - stay for the AI #slop #adtech #deepfakes #bots #leaks #trolls #bubbles #linkrot #fintech #osint #dataloss #softwaredecay #venturecapital #platformlabour #webarchives #diywebsites #algorithms #surveillancecapitalism #poeticcomputation & more 😋
article on “the datafied web” in Internet Histories

An article on “the datafied web” that I co-authored has just been published in the Internet Histories journal.

Jonathan W. Y. Gray

@WvOostveen Misschien helpt het mensen soms over de streep om toch maar een abonnement te nemen? Goede #journalistiek is belangrijk.

Zelf doe ik er vrijwel altijd wel een archief-linkje bij, zodat iedereen het kan lezen en het artikel ook voor het nageslacht bewaard blijft.

Daar is ook een gemakkelijke add-on voor beschikbaar:

https://github.com/dessant/web-archives

#archive #webarchives

National Library of Finland: Principles for Finnish Web Archive content selection published. “The National Library of Finland is responsible for the diverse and representative preservation of online material. To make this work more transparent, we produced a document entitled Content selection for the Finnish Web Archive, outlining the principles for content selection in thematic and continuous […]

https://rbfirehose.com/2026/05/13/national-library-of-finland-principles-for-finnish-web-archive-content-selection-published/
National Library of Finland: Principles for Finnish Web Archive content selection published

National Library of Finland: Principles for Finnish Web Archive content selection published. “The National Library of Finland is responsible for the diverse and representative preservation of…

ResearchBuzz: Firehose

Tom’s Hardware: Internet archival sites struggling to preserve the internet because of skyrocketing hard drive prices due to the AI boom — Wayback Machine and Wikimedia punished by stratospheric storage pricing and stricter anti-scraping measures blocking the wrong bots. “The internet is getting harder to archive because the AI boom has caused a storage crisis, with both NAND and mechanical […]

https://rbfirehose.com/2026/05/09/toms-hardware-internet-archival-sites-struggling-to-preserve-the-internet-because-of-skyrocketing-hard-drive-prices-due-to-the-ai-boom-wayback-machine-and-wikimedia-punished-by-stratosphe/
Tom’s Hardware: Internet archival sites struggling to preserve the internet because of skyrocketing hard drive prices due to the AI boom — Wayback Machine and Wikimedia punished by stratospheric storage pricing and stricter anti-scraping measures blocking the wrong bots

Tom’s Hardware: Internet archival sites struggling to preserve the internet because of skyrocketing hard drive prices due to the AI boom — Wayback Machine and Wikimedia punished by stratosphe…

ResearchBuzz: Firehose
📯This week in #DigitalHistoryOFK: Jane Winters (University of London) presents “From ‘digital dark age’ to ‘an age of historical abundance’?” on born-digital cultural heritage. Exploring web archives, social media & personal data, the talk reflects on preservation, infrastructures & historical value.
📅6 May 2026, 16–18 CET (Zoom)
ℹ️Info: https://dhistory.hypotheses.org/13238
#4memory #WebArchives #DigitalPreservation
@historikerinnen
@histodons
@digitalhumanities
ChallengeAction - AWS WAFV2

Specifies that AWS WAF should run a Challenge check against the request to verify that the request is coming from a legitimate client session:

New blog via SAA from my colleague and me about our process for archiving #warc files from Archive-It #digipres #webarchives https://saaers.wordpress.com/2026/04/22/an-approach-to-backing-up-internet-archive-web-crawls/
​​An Approach to Backing up Internet Archive Web Crawls

By Susan Borda and Scott Witmer  Adapted from the DPC Digital Preservation Workflow Webinar series, March 2026 The University of Michigan Library web archiving initiative began as a pilot prog…

bloggERS!