OFC all this #MassSurveillance and #DataHoarding csn only be used to amass "#Blackmail|ing Material"...
#USpol #politricks #SurveillanceState #data #blackmailing #politricks #politics #tech #fascism #cyberfascism #GamersNexus

How to Create an Offline Version of Websites Using Kiwix and ZIM Files
#zim #kiwix #offline #SelfHosting #DataHoarding #WebScraping
> Access web content without the Internet — wherever you go.
https://wiki.openzim.org/wiki/Zimit
https://github.com/openzim/zimit
⚙️ Optional: Create Your Own ZIM File
Don’t see the website you want in the Kiwix library? No problem!
Use Zimit to generate your own ZIM file.
Steps:
https://noted.lol/convert-any-website-into-a-zim-file-zimit/
💡 Note: Not all websites are easily portable to ZIM format, especially dynamic sites with login systems or lots of JavaScript.
RE: https://mastodon.social/@Migueldeicaza/115713071946346627
Holy crap. What a case study in why it is important to decouple data and devices from corporate monopolies!
Quite happy to have found a few utilities that are helpful in sifting through a full storage device.
- NCDU 'NCurses Disk Usage'
https://dev.yorhel.nl/ncdu
- ripgrep & ripgrep-all
https://github.com/phiresky/ripgrep-all
Thanks @fschaap for the suggestions to find duplicate data. Looking into them. In addition I also found fslint.
Thanks @nicorikken for the idea of having a shell script to monitor changes & a trigger for cleaning up. Will try this as well.
Bleeping Computer: American Archive of Public Broadcasting fixes bug exposing restricted media. “A vulnerability in the American Archive of Public Broadcasting’s website allowed downloading of protected and private media for years, with the flaw quietly patched this month. BleepingComputer was tipped about the flaw by a cybersecurity researcher who asked to remain anonymous, stating that the […]
whoops, spent all night downloading Wikipedia articles to read on my offline Kindle Paperwhite: level 1 and 2 vital articles (https://en.wikipedia.org/wiki/Wikipedia:Vital_articles), and a lot of random stuff on computing/psychology/politics
I picked pre-ChatGPT revisions (2022-11-29 or earlier) for shits n giggles but Wikipedia's built-in PDF downloads were broken for older article revisions ime, so I had the choice of using wget or SingleFile. I picked the former and just cleaned my filenames