Foss webscraper - Lemmy.World

Not OP. This was posted to self hosted on reddit and might be useful to some. Original post - https://www.reddit.com/r/selfhosted/comments/1glf06d/comment/lw1e4zd/ [https://www.reddit.com/r/selfhosted/comments/1glf06d/comment/lw1e4zd/]

GitHub - jaypyles/Scraperr: Self-hosted webscraper.

Self-hosted webscraper. Contribute to jaypyles/Scraperr development by creating an account on GitHub.

GitHub

Scraperr is a self-hosted web application that allows users to scrape data from web pages by specifying elements via XPath. Users can submit URLs and the corresponding elements to be scraped, and the results will be displayed in a table.
From the table, users can download an excel sheet of the job’s results, along with an option to rerun the job.
View the docs.

Welcome to the Scraperr Docs

A guide on how to use Scraperr.

Scraperr Docs
Yes looks very interesting.