VERY urgent that you guys backup and archive AngelFire sites, we probably have like 2 weeks.

@ocean Archive Team are on it, it seems: https://wiki.archiveteam.org/index.php/Angelfire . Probably best to go check on the status over there on their IRC. It's very easy to contribute to Archive Team rescue ops, they have a VM image for volunteers to run that joins their swarm and contributes some bandwidth to the effort.

EDIT: looks like maybe not actively crawling it right now, but if Angelfire's death is making you have activist feelings about preserving the old web, ArchiveTeam is the crew for you! They have a swarm of crawlers that can be retasked to new things, they may just need someone who cares to write the necessary code to spin up the crawl. Not sure if you can go from zero to useful in the timeframe Angelfire needs, but if you want to be a preservation activist in general, you'll probably have more impact with them than alone.

Angelfire - Archiveteam

@danderson @ocean how do i get that vm image? i have a few servers i could throw that on
@Li @danderson @ocean hey, check their docs on how to run the Archive Team Warrior: https://wiki.archiveteam.org/index.php/ArchiveTeam_Warrior#Basic_usage_(virtual_machine) There's also docker-based deployments as well, but the basic one is the VM indeed.
ArchiveTeam Warrior - Archiveteam

@imrehg @danderson @ocean i installed it but i dont see angelfire in the list :?
Fish in the Percolator (@[email protected])

@[email protected] @[email protected] I was looking at that project, but it says its status is "On Hiatus", so I don't think it's currently active?

Fosstodon
@danderson @ocean I was looking at that project, but it says its status is "On Hiatus", so I don't think it's currently active?
@imrehg @ocean The wiki sometimes lags ground truth, especially for things that suddenly become endangered. It's possible AT aren't on this, but the IRC channel would be the place to find out for sure.

@danderson @ocean yeah, wikis gonna be lagging, definitely gonna check in there.

In the Warrior interface that Angelfire project is not listed as an option (as flagged by others too), hence it seems indeed effectively not working.

IRC it is, at the next opportunity:)

@imrehg @ocean If you have permanent resources to spare, my suggestion would be to set the warrior project to "ArchiveTeam's choice" (or whatever the exact text is). That lets them decide what project(s) most urgently need archival capacity and readjust on the fly as urgent things happen.

According to the wiki (so no idea if accurate :) ), currently team's choice is a weighted mix of archiving Telegram groups, Roblox Groups (kinda like facebook walls I think?) and a general ongoing crawl of large sites that are semi-endangered or would be hard to archive in a hurry (e.g. due to aggressive rate limits).

Obviously requires some trust in archiveteam to not screw around, and I can't do that evaluation for you. But I decided that, with a VM boundary, I trusted their intentions enough to let them pick the priorities.

@danderson @ocean yeah, I usually do that Team Choice, it's very heavy telegram at the moment, as you said :)
@imrehg @ocean But yeah if Angelfire doesn't currently show up in the project list, that's a good indicator that there isn't an active crawl, either because it's not online yet or maybe it's done? idk if angelfire sites can still change or if it's effectively static at this point
@danderson @ocean I've looked through the scraper (last change 7 years ago), and fixed things up rough-and-dirty (will clean up and push it to GitHub there too, shortly). The scraper runs, but since ArchiveTeam has no jobs out, it just idles. I guess it will need central coordination, or maybe I've missed something? 🤔
@imrehg @ocean yeah I think once a working scraper build is up, there also needs to be something generating seed URLs to start from (iirc the wiki suggested starting with the sitemap?), and then archiveteam people need to push some buttons to open up the new job and maybe move some of the swarm over to it? Not sure, I've never MC'd a crawl just observed other AT crawls happen