this post was submitted on 02 Sep 2024
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/automaton11 on 2024-09-01 21:05:47.

Hey everyone. I figured someone here might be able to help guide me. I'm trying to mirror some pages from a forum at https://ampgarage.com and having an issue.

Here is an example of a page I am trying to mirror. If you scroll through, you can see that some posts include attachments which are unavailable unless the user is logged in, which my mirror reflects.

I signed in on firefox and exported my cookies with the cookies.txt extension, which I passed to my httrack command, but the mirror still failed to get the attachments, showing the same red bar as if I wasn't signed in.

I did ask chatgpt, which provided a number of possible alternate avenues, but I don't understand them well, and so it will take me a while to investigate each possible solution. So I figured since this seems like a relatively simple site, maybe someone on here might be able to give me more direct advice.

Here is the httrack command I used:

httrack "https://ampgarage.com/forum/viewtopic.php?f=5&t=32047" -O "/home/automaton11/amp_garage_mirror/test2/" "+*.ampgarage.com/" --sockets 1 --max-rate 50K -%c.2 --cookies /home/automaton11/Desktop/cookies.txt -r1 --user-agent="Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/15.0 Safari/605.1.15"

NOTE: The site is unstable and has been going up and down for the past few days (why I want to archive it). The biggest issues seem to be with the 'Dumble Files' and 'Dumble Discussion' sections so if you need an example of the problem, here is a page that seems to have a lot more uptime.

Thanks a lot for your help

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here