this post was submitted on 20 Jan 2025
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/Mammoth_Inspector_36 on 2025-01-19 17:59:46.

Hi everyone.

A few years ago I begun archiving https://www.russian-records.com/

I remember having an issue before with the embedded audio links not exposing themselves to jdownloader or linkgrabber extension for chrome, i usually use both in tandem. In the end I was able to essentially webscrape the site using both tools, where Jdownloader's find Url function would help me find the artist/year./title etc so i could input the id3 tag.

The best I cant find now is this, however the audio contains no ID3 tags, nor a link in jdownloader to point to them.

https://www.russian-records.com/data/media/

Now though....I cant seem to expose any mp3s the old way, which is strange, i must be missing something id done before.

This is how each mp3/album cover link presents itself alone.

https://www.russian-records.com/details.php?image_id=11904

Clicking on each to download maybe over 100,000 audio files would be a bit much for sure, question is how was I automating it before. I've only ever used Jdownloader 2 and linkgrabber extension for chrome.

Just to be on the safe/legal side, I'm only downloading the out of copyright/public domain stuff of course..

https://preview.redd.it/l9pv69qdszde1.png?width=1120&format=png&auto=webp&s=377b3249bea215229f12ec6b53699f5c58560ce9

Before, I used to use linkgrabber here to filter the links, then use Jdownloader2 to do a deepcrawl, although that no longer seems to work

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here