this post was submitted on 03 Jun 2025
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/clickbatedubs on 2025-06-03 01:34:21.

I'm trying to figure out how to completely mirror a version of a site from the Wayback Machine. Basically I want to download the full thing sorta like HTTrack or ArchiveBox does, but using the archived Wayback Machine version instead.

I’ve tried wayback-downloader and the Strawberry fork, but neither really worked well for anything large. Best I’ve gotten is a few scattered pages, and a ton of broken links or missing assets that function fine on the actual waybackmachine.

Anyone know a good way to actually pull a full, working snapshot of a site from Wayback? Preferably something that works decently with big sites too.

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here