this post was submitted on 15 Jul 2024
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/2600_yay on 2024-07-15 03:58:39.

Does anyone have any backups of the old Los Alamos Arxiv server from the mid-2000s or so? I know it's a long shot – someone having a backup of a probably-large server of academic papers from the 2000s (or I'll even take earlier backups too from the 1990s!), but wanted to ask here as maybe one of you knows someone who knows someone who knows someone with a tape drive backup somewhere - haha

https://web.archive.org/web/20070520024759/http://lanl.arxiv.org/

The preprint server 'only' got about 50,000 papers per year sent to it in the mid-2000s, per this Arxiv history blurb here, which is 'only' about 200 GB, which I know is small now, but would have been quite the feat to back up personally, back in the day. Or if anyone knows of a search for bibliographic records that would have existed on lanl.arxiv.org back in the day – obviously the search server which used to run on :8081 - isn't usable via the Wayback Machine backups, I'm all ears. Basically, I'm trying to find a few papers with a certain substring mentioned in the paper that no longer exist on the live/current arxiv.org website.


Somewhat related to LANL: does anyone have any of the old 'libraries' backed up from sites like the DoE? The Department of Energy used to have a library online with a few hundred thousand papers in it. Alas, that too was taken down many years ago.


In general, if anyone has a list of what large corpora of scientific literature used to exist online - from National Laboratories, the DoE, or other science R&D orgs - I'm all ears.

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here