484
save this rule (lemmy.dbzer0.com)
you are viewing a single comment's thread
view the rest of the comments
[-] anton@lemmy.blahaj.zone 5 points 11 months ago

It won't save everything, but if a script follows every link recursively, most content should be reached that way. That's kind of what Google does but for one site instead of the internet.

If there is a search function try very simple queries.

The alternative of brute forcing links would be unfeasible, even if you are not rate limited by the site, due to the exponential complexity.

If you want to do something please look into api/scraping etikette like exponential back off.

this post was submitted on 13 Dec 2023
484 points (100.0% liked)

196

16591 readers
2397 users here now

Be sure to follow the rule before you head out.

Rule: You must post before you leave.

^other^ ^rules^

founded 1 year ago
MODERATORS