this post was submitted on 19 Feb 2025
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/EngagedWorldWizard on 2025-02-19 06:22:18.

There are obviously a lot of very sophisticated DevOps types on here, but I was just wondering if there is a need for a script to re-assemble sites from archive.org that have already been taken down (usaid.gov was the one I was working on, until I found out about ArchiveTeam Warrior project (which I am now running).

(I would still love to know if you got usaid.gov.

Anyway, it is a little bit tricky job with a large site, because you have to gather the most recent snapshot of any file (which are all in directories by date), and then piece it all together and have it play nicely. If there is interest in this, let me know, because I was making pretty good progress on it.

(I know there is a Ruby gem that is supposed to do this, but my results were not that good.)

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here