Chris's Wiki :: blog/web/WebScrapingItsNotJustLoad

I donʼt know whoʼs running the #ArchiveTeam #ArchiveBot, but you just donʼt load someoneʼs server like that. Loads multiples greater than Googlebot and Baidu. Not the first dodgy scraper found on Github, and I canʼt trust that people arenʼt using them as theftbots.
"The seized domains – Justicehomeland[.]org, Handala-Hack[.]to, Karmabelow80[.]org, and Handala-Redwanted[.]to – were used by the MOIS in furtherance of attempted psychological operations" #ArchiveTeam

RE: https://bsky.app/profile/did:plc:v63y3emmbz5dlvzkejca2mcc/post/3mhh37nqqpd2z

@jonny Nice. The torrents are seeding smoothly. https://sciop.net/tags/smithsonian

However I'm not getting any downloads from the webseed, at least in Trasmission 4.0.6 (38c164933e). It's looking for an URL like https://smithsonian-open-access.s3.amazonaws.com/media/NASM-2000-9391.tif . Do they really have all files in a single directory?

#digipres #ArchiveTeam

smithsonian - Tag - SciOp

Preserving Public Information

Archive Team 把 goo.gl 抓完了

剛剛在 Hacker News 上看到 Archive Team 抓完 goo.gl 的消息:「ArchiveTeam has finished archiving all goo.gl short links (archiveteam.org)」,連結是指到 tracker 的頁面,目前看起來是跑完了,不過專案頁面「goo.gl」還沒更新,晚點有人確認後應該會更新上去。 先前在「goo.

Gea-Suan Lin's BLOG

Digital archival projects are crucial in the fight against fascism. I wrote about the why and the how.

And if you're reading this, that means you have a computer, so you too can contribute!

https://carefullmusings.bearblog.dev/the-urgency-of-digital-archiving/

#ArchiveTeam #SciOp #fascism #archive #resistance #DigitalPreservation

The urgency of digital archiving

I'll keep this one short and sweet: I'll go over the and the . Fascists are grappling for power around the world. The US is only the most notable example...

Care-full musings
@raffaele just a heads up: if you wanna help save what’s there you can use a #ArchiveTeam warrior http://warrior.archiveteam.org and set it to work on the goo.gl project
ArchiveTeam Warrior

@slashdot good, one less stupid link shortening service that breaks the web.

Luckily those good folks at #ArchiveTeam have been trying to archive them as much as possible.

You can help out save these short-sighted services by running your own VM that archives websites and uploads them to the @internetarchive

Check out ArchiveTeam and get the software here: https://wiki.archiveteam.org/

#archiveteam put some huge work in their applications recently.

Consider running a #ArchiveTeam_Warrior
https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior