The original post: /r/datahoarder by /u/wiloma on 2024-08-12 20:04:19.
My friend deleted her blog but wants some pictures back.
Looking at Ameblo, it stores images a little oddly like so:
https://stat.ameba.jp/user_images/YYYYMMDD-directory/XXnumbered-directory/BLOGNAME/YYnumbered-directory/ZZnumbered-directory/Single-letter-folder/foo.jpg
The easiest approach for my porous brain would be to replace the numbered directories with wildcards and include anything that had BLOGNAME folder. But this is well beyond a simple download manger like JDownloader or DTA. HTTrack requires the original html link she posted, doesn't it? Which I of course don't have. Nothing else in the wiki seems any more promising than this. Beautiful Soup maybe?
What/how would you recommend to proceed? I can hack a little in shell or python if necessary (I'm on a Gentoo machine), but I'm not sure how to set up the process. Optimally I'd just download any images in BLOGNAME and its subdirectories and she and I can sort through them later.