It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
4026
 
 
The original post: /r/datahoarder by /u/adobe-is-a-free-elf on 2025-01-31 10:51:52.

Cheers folks, I have spent all morning trying to fetch information from the following italian archive website: http://archivicomunali.lazio.beniculturali.it/comunali/HAPConsole.aspx?

For starters, the current version of the website is broken.

However a archived version from 2016 on archive.org still works!

https://i.imgur.com/RBcDYZ9.png

Each click is a new request (e.g http://archivicomunali.lazio.beniculturali.it/comunali/AJAXHierarchy.ashxid=hap:localhost%2Fxw%2Fascomunali%2Fhier%2F120841&AspxAutoDetectCookieSupport=1) that will return the contents of the folder (of course archive.org adds their proxy beforehand (in this case https://web.archive.org/web/20250131100802).

It works fine until we reach a document, because thed document page that is requested has not been archived.

https://i.imgur.com/1elPu9H.png

This is an example of trying to access https://web.archive.org/web/20160507223948/http://archivicomunali.lazio.beniculturali.it/comunali/AJAXDocument.ashx with the payload being <document uri:"hap:localhost/xw/ascomunali/hier/120868" mode="hierarchy" />"

I tried using curl and wget to no avail. Any help is appreciated. Please help me find my great grandfather birth records! It would mean the world to me and my family.

4027
 
 
The original post: /r/datahoarder by /u/NotGoodWithWordses on 2025-01-31 01:40:09.

There is a Rumble channel that has uploaded and continues to upload all the footage from January 5th and January 6th 2021. There's footage of the riots as well as some of the individual who planted the pipe bombs. What would be the best way of archiving the entire channel?

https://rumble.com/c/CHASubcommitteeOnOversightRepublicanMajority

I worry that within the next few months the channel might be removed entirely or made inaccessible. I've been able to archive a few of the more important videos, however there are about 52k videos in total on the channel and it seems daunting to try and save as much as possible.

4028
 
 
The original post: /r/datahoarder by /u/nevin_2 on 2025-01-31 07:26:15.

I'm working on backing up all of my movies and TV shows to make a Plex server and have them digitally as I would rather that than have them as discs. Still going to keep the discs as a backup but what would be better to use Seagate IronWolf 4TB NAS Internal Hard Drive HDD – CMR 3.5 Inch SATA 6Gb/s 5900 RPM 64MB Cache for RAID Network Attached Storage ST4000VNZ06

Seagate BarraCuda 4TB Internal Hard Drive HDD ST4000DMZ04/DM004

I am leaning towards the first one as it's $10 cheaper but which one would last longer and be better overall for my case

also, 1 TB is nowhere near enough lol

4029
 
 
The original post: /r/datahoarder by /u/pavoganso on 2025-01-31 06:13:38.

Did anyone manage to make backups before they were deleted?

4030
 
 
The original post: /r/datahoarder by /u/OnePersonExists on 2025-01-31 05:41:04.

The title says it all, I was trying to get a video but I saw it had blob. I did the network thing, but it had no m3u8. When I put in m3, I got MP4 instead. How do I download that?

I even got the screenshot:

https://preview.redd.it/3x8jyroeo9ge1.jpg?width=624&format=pjpg&auto=webp&s=82254b20633433e71ea4a220fb8912d049684ed8

4031
 
 
The original post: /r/datahoarder by /u/Never_Sm1le on 2025-01-31 02:40:30.

There's a server part dealer near my home offer these deal on Lunar New Year discount. What should I get?

Seagate Enterprise Capacity v7 ST12000NM0127 ~137$

Seagate Exos ST12000NM0127 ~157$

WD Enterprise Ultrastar DC HC520 HUH721212ALE604 ~125$

WD Enterprise Ultrastar DC HC530 14TB - WUH721414ALE6L4 ~147$

As far as I can tell, the two Seagate are new and the same, just different brandings

The two WDs are apperently recertified

All have 2 years warranty

Sidenote: I'm in South East Asia so no Server Part Deal or Amazon here

4032
 
 
The original post: /r/datahoarder by /u/uefcommand on 2025-01-31 01:52:39.

I found my server no power lights zero lights on anything. I tried to change outlets and power cables and nothing.. ideas?

4033
 
 
The original post: /r/datahoarder by /u/garn05 on 2025-01-30 23:13:48.
4034
 
 
The original post: /r/datahoarder by /u/daburgr on 2025-01-30 23:12:23.

Hi all, I am using "Batch Link Downloader" extension to download a heap of files from an archive site, however, overnight my pc had shutdown unexpectedly. I want to continue the downloads but I don't want to download what I already have which is a large portion.

There are way too many files to manually check what has and hasn't been downloaded and I already have a large portion downloaded with another large portion to download. Batch link downloader unfortunately doesn't seem to have a way to check if I already have the file in the download folder and just downloads the file again, creating a duplicate with the added (1) at the end of the file name.

Does anyone know of a workaround or another extension that can help me avoid downloading what I already have? I am using Brave browser if that helps.

Thanks

4035
 
 
The original post: /r/datahoarder by /u/liger_0 on 2025-01-30 22:57:46.

I'm looking to get a multi-drive USB enclosure so that I can use some smaller capacity drives I've got laying around (8TB, 4TB, 3TB) for non-critical storage (dumping steam games I barely play/haven't played yet onto it) without having to have a bunch of power bricks for SATA adapters plugged in. However, I'm fairly ignorant of what I should get. I found this on Amazon but I'm not familiar with the company.

4036
 
 
The original post: /r/datahoarder by /u/dekoalade on 2025-01-30 22:45:33.

I was thinking to buy an USB to SATA Adapter. Do I need to also buy an enclosure because the internal hdds could be damaged by being exposed to dust or other things? Are there some brands or things I should look for? Thank you

4037
 
 
The original post: /r/datahoarder by /u/New-Negotiation7234 on 2025-01-30 22:31:14.

Buddy of mine who works at CDC just texted me. He said he just got out of a meeting where they received direction to delete all mentions of trans people from the website—all webpages dedicated to them, and even minor mentions on generic pages, even including "pregnant people." Reference citations to papers with "pregnant people" in the title need to be rewritten or removed. All data associated with trans people is to be deleted. This all needs to happen by tomorrow afternoon, which is an impossible timeline. I told him that's probably on purpose. They want him to fail so they can fire him.

We gotta screenshot pages like this while we still can: https://www.cdc.gov/std/treatment-guidelines/trans.htm

https://www.cdc.gov/hiv/data-research/facts-stats/transgender-people.html

As a fellow PH practitioner, I am fucking appalled. By next week, trans people won't exist according to the CDC.

4038
 
 
The original post: /r/datahoarder by /u/Anarcho_Christian on 2025-01-30 22:02:45.

I'm using a LSI sas9207-8e, a 4-way SAS breakout cable, and two 4TB SAS drives.

It works. Kinda.

  • I'm able to see drive 1 if it's the only drive plugged in.
  • I'm able to see drive 2 if it's the only drive plugged in.
  • I'm only able to see drive 1 when i boot up with both drives plugged in.

Anyone else ever had this issue?

4039
 
 
The original post: /r/datahoarder by /u/cowjuice11 on 2025-01-30 21:40:17.

Ive been using 2 segate 2tb drives for about 3 years now in a raid 1 setup on my linux server. Never had a problem once before. Then i saw louis rossman's new video talking about drives advertised as new being used and he mentioned backblaze's stats that show segate with a high failure rate. so like any normal person i used a free toll called scrutiny to check the health of my drives and got this: https://imgur.com/a/i6bmsGs How cooked am i? i still have good read and writes on them and ive never had any data loss. the most confusing part is the 8yrs life span.

rossman video https://www.youtube.com/watch?v=bFscU8JUohA

4040
 
 
The original post: /r/datahoarder by /u/Endeavour1988 on 2025-01-30 20:32:45.

I have a Windows box I use for Jellyfin, aka my datahoarder setup. I'm keen to get a new drive for the family photo's. How do you organise your pictures in a folder structure way?

Secondly due to the volume, its not loads maybe 300GB (not RAW either) all Jpegs, should I opt for an SSD? I'm just thinking a 2TB will suffice and also all those thumbnails loading surely a HDD will get bogged down?

Lastly manual organisation or should I use an application or maybe something where I can host them on my local network only?

4041
 
 
The original post: /r/datahoarder by /u/denierCZ on 2025-01-30 19:08:21.

I bought 2x HC560 20TB, put them in my Synology in RAID1 and this morning I wake up to a beeping Synology. The 04 AM rsync backup task caused some kind of failure, the Disk 1 is now degraded. I took it out and I am doing a read surface test, while rsync copying the data from the remaining functional drive.

How is this possible? The disks were brand new, never used, not recertified or anything. I got them for a few months. How can one of them (seemingly) fail after 5000 hours?

SMART is OK for both.

4042
 
 
The original post: /r/datahoarder by /u/Blood_Wraith7777 on 2025-01-30 17:44:40.
4043
 
 
The original post: /r/datahoarder by /u/erik1220 on 2025-01-30 17:12:05.

Home | Video Game History Foundation Library – Digital Archive

"-Guidebooks and ephemera from game events, including searchable directories and maps from the first 12 years of the Electronic Entertainment Expo (E3).

-An extensive international collection of From Software promotional materials, collected by citizen archivist Kris Urquhart, with a blessing from FromSoftware to donate them to the library.

-100 CDs of art and press releases from GamePro’s magazine archive.

-Over 100 hours of footage from the production of the Myst series, including never-before-seen interviews with the Cyan team.

-The Mark Flitman papers, a treasure trove of documents collected over the course of Flitman’s career at video game publishers like Konami, Acclaim, Atari, and more."

-Wario64 on X

4044
 
 
The original post: /r/datahoarder by /u/Little_Instance8623 on 2025-01-30 17:11:03.
4045
 
 
The original post: /r/datahoarder by /u/CYP446 on 2025-01-30 16:33:12.

Hi everyone,

I'm in a bit of a bind and could use some advice on expanding the storage of my ITX system cost-effectively, given some space constraints.

Here's the situation:

I had to downsize recently due to losing my job, and with it, I lost my home office space. This has forced me to rethink my PC setups. I have two systems: one e-ATX filled with SSDs and HDDs, and a smaller ITX SFF system that I'm focusing on now.

The ITX system is running on a ITX (B550I) motherboard. It's already got two M.2 slots filled, and the single PCIe slot is occupied by a GPU. There are four available SATA ports, but the internal space is so tight that I can maybe fit a single 2.5" drive, and that's about it.

I've considered external solutions. The motherboard does have a USB 3.2 Gen 2 Type-C port, which is fast, but I'm trying to keep my physical setup as compact as possible. External drives are an option, but I’d prefer something less cumbersome.

One idea was to shuck one of my 5 TB WD 2.5" Easystore USB drives, but I've heard that the newer WD external drives aren’t really shuckable anymore.

Does anyone have experience with compact, affordable storage solutions or any tips on how I might be able to add significant storage without expanding my system’s footprint too much? I'd love to try and avoid having cables and desk clutter as much as possible.

Any help would be greatly appreciated!

Thanks in advance!

4046
 
 
The original post: /r/datahoarder by /u/Mk23_DOA on 2025-01-30 15:55:29.

For a new project I am looking for an alternative for WD red M2 SATA SSD's. Initially I wanted to go with WD Red's (again) but they are either on backorder or have gone up in price 20% since Q3 last year.

So I am looking for 2TB M2 SATA, NAS rated drives with something like 540-560 W/R speeds. I haven't been able to find Seagate EXOS in M.2

4047
 
 
The original post: /r/datahoarder by /u/Refinery73 on 2025-01-30 15:16:23.

Hi everyone,

I’m sitting on a pile of a few hundred thousand PDFs from local government als city hall meetings from half the county.

I’m wondering what to do with it and like to discuss your opinions.

I was able to easily scrape them from the gov website and the files are public. I see archive value in them for city history and political studies. They are however created by a bunch of different cities and departments and lack any clear license. The robots.txt didn’t prohibit scraping but I don’t exactly own them. On the other hand it’s public government information. Not US-based so I don’t want to discuss about licensing of public documents but how you would approach this dataset.

I thought about ‘preservation first’ and ‘public interest’ so to create a torrent archive for each city and start seeding it. I’m not sure however if someone has a better idea.

There is no public archive for this and cities have been losing these left and right when changing platforms and not caring about migrating. For them the relevant file is some signed printout in some drawer. They just don’t care.

4048
 
 
The original post: /r/datahoarder by /u/game_stailer94 on 2025-01-30 12:16:27.

I created this tool after reading the recent Heise article (https://www.heise.de/en/news/Fraud-with-Seagate-hard-disks-Dozens-of-readers-report-suspected-cases-10259237.html) about potentially fraudulent Seagate drives being sold as new. The tool leverages smartmontools to compare two different power-on hour counters in Seagate drives:

  1. Standard SMART Power-On Hours attribute
  2. Seagate's proprietary FARM log Power-On Hours

In legitimate new drives, these values should match (or have minimal difference). A significant discrepancy could indicate tampering or misrepresented usage history.

The tool is available as both a shell script and Docker container: https://github.com/gamestailer94/farm-check

Technical details:

  • Requires smartmontools 7.4+ (Docker container recommended and includes this requirement)

  • Works with any Seagate drive (non-Seagate drives will be skipped as they lack FARM data)

  • Can check single drives or scan all connected drives

Docker is the recommended way to run this tool as:

  • It works regardless of your distribution's smartmontools version

  • Ensures consistent behavior across different systems

  • No need to install or manage dependencies

  • Pre-built container available and ready to use

For those who prefer direct installation, you'll need:

  • Linux system

  • Root privileges (needed for SMART access)

  • smartmontools 7.4+

  • Seagate drive(s) to check

Since Heise is a German tech news site and the reported cases are primarily from European sellers, this might be more relevant for the European market. However, given the global nature of hardware sales, I thought it might be useful for the broader homelab/selfhosted community.


Disclosure: This post was formatted and refined by Claude (AI) with my guidance, as I wanted to ensure the information was presented clearly and engagingly.

4049
 
 
The original post: /r/datahoarder by /u/seska999 on 2025-01-30 08:13:13.
4050
 
 
The original post: /r/datahoarder by /u/SchizophrenicScreams on 2025-01-30 20:30:11.
view more: ‹ prev next ›