Wayback Machine Director: We Are ‘Collateral Damage’ in the Fight Between AI Companies and Publishers

https://fed.brid.gy/r/https://blog.archive.org/2026/05/06/wayback-machine-director-we-are-collateral-damage-in-the-fight-between-ai-companies-and-publishers/

Wayback Machine Director: We Are ‘Collateral Damage’ in the Fight Between AI Companies and Publishers | Internet Archive Blogs

残念ながら、 #WaybackMachine には残っていなかった

"IPA通報代行、始めます

脆弱性っぽいものを見つけたけど、報告書を書くのも窓口対応も面倒な方向け。
まずはDMへ。"
https://x.com/WATab2000/status/2052174236807286973?
"【悲報】IPA通報代行、ガイドライン違反で非公開に"
https://x.com/yousukezan/status/2052204314219913527?

WaTabⓂ️🇺🇸🇯🇵 (@WATab2000) on X

IPA通報代行、始めます 脆弱性っぽいものを見つけたけど、報告書を書くのも窓口対応も面倒な方向け。 まずはDMへ。 https://t.co/nBs1jNoOZO

X (formerly Twitter)

Publishers fighting AI companies are increasingly colliding with web preservation systems ⚖️

In PRESERVING THE WEB IN THE AGE OF AI, @masnick.com examines how attempts to block AI scraping restrict access to archiving tools like the Wayback Machine, creating collateral damage for the public record.

🎧 Listen on the #FutureKnowledge #podcast ⤵️
https://futureknowledge.transistor.fm/episodes/preserving-the-web-in-the-age-of-ai

#WaybackMachine

@internetarchive

Well, it seems didn't even take a quarter of a year for reality to bite:

»Brewster Kahle, founder of the Internet Archive and the Wayback Machine, the most important archiving projects in the history of the internet, told 404 Media that the skyrocketing costs of storage is “a very real issue costing us time and money.”

“We have found that the preferred 28-30TB drives are just not available or at very high price,” Kahle said. “We gather over 100 terabytes of new materials each day, and we have over 210 Petabytes of materials already archived on machines that need continuous upgrades and maintenance, so we need to constantly get new hard drives.” «

https://www.404media.co/the-ai-hard-drive-shortage-is-making-it-more-expensive-and-harder-to-archive-the-internet/

#WaybackMachine #InternetArchive

The AI Hard Drive Shortage Is Making It More Expensive and Harder to Archive the Internet

The Internet Archive, Wikimedia, academics, and hobby archivists are having trouble finding hard drives or are having to pay extremely high prices for them.

404 Media

RIP Ask Jeeves. The natural-language search engine founded in 1996 was rebranded as Ask in 2006, and officially shut down on May 1.

Here are the Wayback Machine’s first and last captures of the site.

When websites disappear, the historical record can disappear with them. The #WaybackMachine preserves that history – capturing the web so its past remains accessible.

Explore 30 years of web history: https://web.archive.org

#90s #90sNostalgia #WebHistory #WebDesign

Love the Wayback Machine? ❤️
Here’s your chance to stand up for it 📣

When news can't be archived, we all lose part of the public record 🕳️📰

Tell major publishers: keep journalism in the #WaybackMachine.
✍️ Sign here: https://savethearchive.com/NewsLeaders

This campaign is a project of @fight

I lost a backup. Wayback machine got my back. I scraped the site, got my files back and donated 50 buckeroos. Thanks #waybackmachine
On World Press Freedom Day, a Call to Keep the News Preserved | Internet Archive Blogs

TIL that in 2001, my old tripod.com site was hacked.

I was looking up my old site, because I was trying to remember an old #freeware #DOS game that I used to play that never got past the demo stage.

And after I looked up my site on the #WaybackMachine, this popup came up. And while I wasn't above putting joke popups, this is not my style at all. I took my site very seriously!

I must have had a third party hit counter or something that was compromised. The 2004 capture doesn't seem to show it and I later migrated to my uncle's domain (when he complained about my site trying to install a dialer on his computer), before abandoning the site completely after a few years. Man, it took me 25 years to find out my website had been hacked. Nooooooooo! I still have the html files I think, so I'll look up my code.

#tripod #00s #nostalgia #oldweb #scareware

Ho scoperto alcune etichette italiane di musica #copyleft risalenti ai primi del 2000 (la mia SubTerra nasce nel 2006), di cui non avevo mai sentito parlare. Di queste solo una sembra ancora attiva. Non appena avrò tempo andrò sui loro siti ormai non più online con la #WayBackMachine per cercare di scoprirne di più e rintracciare qualche artista.

#musicacopyleft #indie #indieitaliano #mastomusica