@catsalad @dangoodin @cyberlyra

Oops left out the important link dealing with webscraping https://www.reclaimcontrol.tech/ #Security #WebScrape

Reclaim Control | over your technology, your devices, your data, your digital life.

#Webscrape
I then though about how maybe I should #develop a #program to #scrape a #webcomics #HTML and create a structured file with links to each page. Something that could probably just run on a cron job and produce a #YAMl or #JSON file containing a predefined structure that can be read by an #android app.

I'm trying to #webscrape this website. But each file is stored in a randomly named folder. Anyone have any advice on how to achieve this? For instance the first file I found is in:

media/obcno431/all-regions-abs-sa4-snapshot_september-2022.xlsx

and the second file I found is in:

media/0tbbwy30/all-regions-abs-sa4-snapshot_july-2022.xlsx

At first I thought they were #hashes of the filename but I can't get a matching hash.

#webscraping #dataengineering #whydotheydothis

For the #30DayMapChallenge Day 2 I wanted to draw connecting lines between prehistoric sites in Scotland. This has taken me down a wild rabbit hole of learning how to #webscrape but has distracted me from building an actual map. I'd live to finish creating this dataset to explore at some other point in the challenge but I think it might be time to try something else for today's map 😅

I'm on the hunt for #software that monitors the #clipboard for links, does a #webscrape for title/summary info, then adds the info as text into a holding file hosted on a cloud share.

Is there such a beast (of some sort), or am I going to have to get my #python #coding hat on?

Preferably #FOSS and works on #Linux