The events that have happened here today marks a new era for the internet. Websites that derive their value from user-submitted content are continuing their move towards sealing off that content and information from the world. Their libraries were created by everyone, but will soon be available to only those who pay.

My greatest advice to those concerned: Start archiving. If you come across something you believe is valuable: make a copy.

If you follow a guide on how to install and configure software and you found it helpful: make a copy.

If you come across a video you really enjoy and find yourself coming back to again and again: make a copy.

If it means anything to you: make a copy, or you'll wish you did.

Steal This Comic

xkcd
@DJGummikuh @n4 I hate that Kindle has a near-stranglehold on the eBook market. At least No Starch Press does better, which is why I buy so many tech books from them.
@matir @DJGummikuh @n4 at least a few years ago it was pretty easy to export a plain epub from Kindle purchased books
@n4 Do you know of any good software that makes this user friendly? Like a personal archive.org that would handle archiving, searching, and serving.

@rick While I do not have a list readily available, I can provide some ideas.

For images, use right-click > save. If the website blocks right-click, then you can inspect element and find the image in the page source.

For text/static webpages, use right-click > save page as. Copy/paste works well for strictly text stuff.

Keep it simple: If you're archiving something, then store it in a widely supported and accessible format.

(1/2)

@rick For videos, use yt-dlp[1]. There is a GUI frontend available for it which should help make it more user friendly[2].

[1] https://github.com/yt-dlp/yt-dlp
[2] https://github.com/ErrorFlynn/ytdlp-interface

(2/2)

GitHub - yt-dlp/yt-dlp: A feature-rich command-line audio/video downloader

A feature-rich command-line audio/video downloader - yt-dlp/yt-dlp

GitHub

@rick @n4 it's mainly focused for creating bibliographies but Zotero is pretty good at grabbing webpages.

It's not a proper archiving tool but it works well for me.

https://www.zotero.org/

Zotero | Your personal research assistant

Zotero is a free, easy-to-use tool to help you collect, organize, cite, and share research.

@rick @n4 For video I've found yt-dlp works really well, though it is command-line based.

https://github.com/yt-dlp/yt-dlp

GitHub - yt-dlp/yt-dlp: A feature-rich command-line audio/video downloader

A feature-rich command-line audio/video downloader - yt-dlp/yt-dlp

GitHub

@rick @n4 If you’re a Mac/iOS user, I can recommend DEVONthink from @devontechnologies

It will store anything from Markdown documents and archived web pages, to videos and audio. Everything is indexed and searchable, and you can make it sync automatically between all your devices. The desktop app is a little daunting at first, as it does a ton of other stuff, but you can ignore most of the menus and advanced features and just dump info into it with the OS’s share button, and find it with the search box.

Also, it is rock solid. You can put gigabytes of data into it and it will stay lightning fast.

@rick @n4 I use Monolith for webpages. It makes a single html file of a webpage.
From their GitHub

“You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive…lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.”

https://github.com/Y2Z/monolith

I use the #Rust CLI but there also seems to be a chrome extension

GitHub - Y2Z/monolith: ⬛️ CLI tool and library for saving complete web pages as a single HTML file

⬛️ CLI tool and library for saving complete web pages as a single HTML file - Y2Z/monolith

GitHub

@n4

Lol.

Been doing this for years on the PC. 😀
More difficult on a smartphone.

@HistoPol @n4 I think there are some apps that make it easy to connect to a NAS. I wonder if at home hardware beyond portable devices will make a comeback.

@grumpasaurus

Yes, that seems like a good idea. I've hardly been using online storage anyway.

What we need to do is, however, is to donate to the #WaybackMachine.

Also, I used to have a program to download a complete website.

@n4

@grumpasaurus @HistoPol @n4
You mean I won't be an anachronism for being the only non gamer still using a deektop with tower.
@the5thColumnist @HistoPol @n4 you can still be special! I mean the last time I dealt with a NAS those things are compact and pretty straight forward. With a mobile app backing up data from the can should be a cinch!
@HistoPol @grumpasaurus @n4 we use a (physically) tiny Synology NAS in our small printing business. Dead easy to use, and will bring home when we retire sometime next year. I take nighty backups to the cloud, but weekly backups to portable hard drives. I am fed up with corporations like Adobe insisting you should save stuff to their cloud, thus making yourself a hostage

@TonyDF

Sounds interesting.
Do you have a product link?
@grumpasaurus @n4

@n4 @HistoPol @grumpasaurus the version we have have appears to have been discontinued DS419slim
Synology_SA_26_03 | Synology Inc.

Synology Product Security Advisory

@HistoPol @grumpasaurus @n4 Yes! I love it, and have been a fan of Synology products for over 15 years

@TonyDF

#NAS for private /SMB use

Ok.
I could still by a new rack.
The question is, when does support run out? Seems to be a 2019 model.

Also: what kind of RAID fo you use?
Are you using HDD or SSDs (faster but shorter-lived, IMO. Flash-chip technology only guarantee about 1m accesses, not all that much.)
@grumpasaurus @n4

@HistoPol If it really comes down to it, I'll take a screenshot when I'm on my phone. It's not ideal, but at least it's accessible. Something is better than nothing.

@n4

I have, in fact, been doing this a lot, too.
The reason being that I use it to document certain IT issues or ask questions.

Regarding the news, however, I am ever more experiencing error message like, "#screenshot is not allowed due to security guidelines," etc.

@n4
💯 but disk space was so limited back at some critical times. 😐

@dexter It is very unfortunate that we sometimes lack the time, resources, or foresight to archive information before it is lost.

Fortunately, we learn from our past and exercise greater judgement when it comes to recognizing what is valuable/important to us in the future.

Additionally, we are lucky to live in a time where data storage is inexpensive, and continues to become less expensive over time.

@n4 @ianRobinson don’t half of us in IT already do this. I’ve a chunk of hard drive committed to various downloads and other bits of software to make things work as so used to companies pulling downloads/activations. What you want to use that 5 year old software? We didn’t mean it when we said “lifetime”…

@jonbradbury @ianRobinson You would be surprised at the lack of backups in some IT departments. It is also sometimes unfortunate that some software installers do not include all of the files required to get the software running, such as downloading those resources from the internet at installation time.

I think what you are doing is very wise. Keep up the good work!

@n4 @ianRobinson I probably wouldn’t based on how often I’m telling people off for crappy backups. Very belt and braces here. Apple has been quicker than you’d like at discontinuing application downloads, and seen various across Mac/windows where expensive software scuppered because you can’t find an update patch somewhere that’s critical to the software, or disabled activation servers that people expected to last forever on their lifetime software

@jonbradbury @ianRobinson I hold a lot of respect for those who take the time to reverse engineer these things, create workarounds, and release working alternatives for everyone.

I love the cooperativeness and creativity that people share online. It makes me grateful that I get the opportunity to experience and benefit from it.

@n4 even some of that is being pulled now. Put AutoCad 2013 (iirc) on win7 on a MBP for a customer last year, tricky job as half links to the right DLs gone. Anyway sorted that, kept the DLs, customer asked for same again this year. Installed but activation servers turned off, customers miffed, well I own it I'll download a pirate copy, only all the pirate links are dead ends and the downloads don't exist. The pirates only want the latest versions it turns out. OK I've got 2003 he says 1/2
@n4 only whilst 2003 will install on W7 it promptly crashes, well documented fault at the time w7 came out, all the known fix downloads are pulled, a company even did a $300 paid for fix in period, only that doesn't exist anymore... God knows what the computer history museum for this period will look like when all the software is subscription, if it even exists on your machine to even begin with... 2/2
@n4 not me quickly grabbing the mp3 for a podcast episode my toddler loves 🤣

@n4 @sebsauvage : that’s one of the reason why I created a browser that create a permanent cache you can fully browse while offline.

Ok, it’s command-line (because I like it that way) but it is invaluable. And, with all the CSS and the cruft removed, you would be surprised how little disk space you need (I use it every single day, synchronizing hundreds of RSS feeds with all their links synchronized and all the page I’ve ever read with their picture and I use ±10Go/year

https://sr.ht/~lioploum/offpunk/

offpunk: An offline first command-line browser for the smolnet

@ploum @sebsauvage That's a really cool project. Well done!
@ploum @n4 @sebsauvage Might be worth wrapping with #Emacs and #Eww at some point, both tagging and bookmarking this in case I get around to it at some point.

@lispi314 @n4 @sebsauvage : don’t hesitate to bring the discussion on the devel list:

https://lists.sr.ht/~lioploum/offpunk-devel

I’m currently in the process of splitting the code in several components to make it more modulable and usable by other projects.

~lioploum/offpunk-devel archives — lists.sr.ht

@n4 just recently, ongoing events made me go out, buy another hard drive and download my entire YouTube favourites list .
Of almost 1700 videos. It went a lot more smoothly than expected.
@n4 fortunately there's a wealth of tools available to do so!
Youtube-dlp and gallery-dl are two of my favorites!
@n4 that's the definition of life!

@n4

Also: for a long storage don't trust CDs, USB memories or even single hard disks.

@GustavinoBevilacqua Blue Ray’s were the best, right?

I’ve heard of some special ones designed for long term storage but I cannot recall the name right now.

@n4

@GustavinoBevilacqua @n4

This is where I'm at.

I have many terabytes of stuff from over the years.

I also have 15 years worth of journals that weigh about 40 pounds. I bought a book scanner to preserve them.

I ultimately want my archive to be cloud-based.

I think cloud options will grow larger and more affordable over time.

About 7 years ago, I tried to slowly upload my ripped CD collection to Evernote. They didn't stop me but weren't too pleased. The process was slow as molasses too.

@Hawkmoon @n4

What about giving everything to archive org?

@n4 Good advice! I cherish the people who did this already 40-50 years ago and still has a copy in their basement.

By the way, also make a note about the original source location. You may well (as I have) come back years later and wonder where you got it.

@n4 Joplin + browser extension work very well to archive web content
@n4 biggest thing I can afford atm is a 1TB ext drive, but it's already in the post. Planning to back up like, a LOT of youtube videos. Don't know if I'll have space for much else.
@n4 I have started doing that and also taking notes from them so even if the article is lost my knowledge and understanding of it will not
@n4 Is there a selfhostable version of archive.is or the like?
@n4 i warmly recommend running an omnom instance, it makes it easy to make a full snapshot of the page you are viewing: https://github.com/asciimoo/omnom/ look at this example https://links.ctrlc.hu/bookmarks (registrations are closed on this instance you should run your own for your community)
GitHub - asciimoo/omnom: A webpage bookmarking and snapshotting service

A webpage bookmarking and snapshotting service. Contribute to asciimoo/omnom development by creating an account on GitHub.

GitHub
@n4

...and push a copy into the Internet Archive, for the rest of the world's benefit.

@n4

🍬 One saying I learned from my assignment in Earth was that not a single burned in the Library of Alexandria was lost due to fire; their knowledge all perished of neglect when scholars ignored their contents and did not copy them. 🍬

@n4 back in the Usenet days, I filled Zip disks with stuff I downloaded. The always-on Internet connection made me lazy…
@n4 Evernote does a passable job at this
@n4 A big thing for me has been cloning git repos. I can't store and maintain everything, but especially anything niche or maintained by an individual on a personal server or anything I think is at risk of DMCA, I try to take a local clone of.