Foss webscraper (github.com)
from AustralianSimon@lemmy.world to selfhosted@lemmy.world on 09 Nov 00:05
https://lemmy.world/post/21798848

Not OP. This was posted to self hosted on reddit and might be useful to some.

Original post - www.reddit.com/r/selfhosted/comments/…/lw1e4zd/

#selfhosted

threaded - newest

tgxn@lemmy.tgxn.net on 09 Nov 07:22 next collapse

project is here github.com/jaypyles/Scraperr

MaggiWuerze@feddit.org on 09 Nov 08:45 next collapse

Scraperr is a self-hosted web application that allows users to scrape data from web pages by specifying elements via XPath. Users can submit URLs and the corresponding elements to be scraped, and the results will be displayed in a table.
From the table, users can download an excel sheet of the job’s results, along with an option to rerun the job.
View the docs.

GravitySpoiled@lemmy.ml on 09 Nov 20:57 collapse

An excel sheet? …

MaggiWuerze@feddit.org on 09 Nov 21:07 next collapse

🤷‍♂️

ChapulinColorado@lemmy.world on 09 Nov 21:23 collapse

Maybe it’s just a CSV?

Blxter@lemmy.zip on 09 Nov 12:17 collapse

Yes looks very interesting.