this post was submitted on 02 Oct 2024
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/DisclosedForeclosure on 2024-10-02 00:41:41.

Hi, I'm new to this. What would be the best way today to mirror an old classic message board?

I'm talking about a scenario where I don't have the access to board's DB or server. So it's purely html backup. Web archive makes a good job at mirroring single specific pages but doesn't seem to be a good fit for message boards with thousands of subpages.

I started using Cyotek to scan and download all subpages, but it's quite slow (8 files per minute) and I don't know how to make it skip certain subpages. I.e. it unnecessarily goes through every user's profile page. It can get stuck there for many hours. Other issue is that downloaded html files would still have absolute asset links (js/css) in their code referencing the old domain - relative links would make the backuped sites more portable, but I'm not sure if any site downloader would automate such substitution?

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here