It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
2076
 
 
The original post: /r/datahoarder by /u/planksmomtho on 2025-04-14 00:00:53.

Hello there, longtime lurker and even longer data hoarder.

I’ve infrequently ripped my DVD and Blu-ray collection over the years, and very recently ramped up with my Criterion Collection Blu-ray Discs. My issue is that I rip them at full quality, as I take massive personal issue with artifacting, and now I have to figure out where to stick them. I currently have 10TB of HDD space on my PC (as I planned on doing this years ago), with only about 2 or 3TB free currently.

I’ve had my eyes on things like the Western Digital 24TB external drives, but the reviews on them are not comforting, so I’m hoping for better recommendations on how to proceed. My PC tower has the space available for a few more 6TB HDDs, but I feel like I’ll just circle back to the same problem within a few years. I don’t exactly understand NAS storage, but I’ll admit that I haven’t looked into it. Hopefully I’ll be steered in the right direction.

Many thanks in advance!

2077
 
 
The original post: /r/datahoarder by /u/Ali_cicek2 on 2025-04-13 23:14:46.
2078
 
 
The original post: /r/datahoarder by /u/DeForzo on 2025-04-13 21:58:22.

Hello guys,

I have a few TB's of data I want to store long term (30+ years), but I have a feeling of uncertainty and doubt with keeping it stored anywhere right now.

I have been to prison once, and the police took every piece of tech from my house (i got into a major fight in someones house and the police thought it was drug related). I got all my tech back later including my hard drive, but I don't trust myself anymore with it basically.

Also keeping it stored with any company makes it feel a little unsave, because last time I went to prison I could not pay my server bill and all my data I had there got deleted.

Probably will never go to prison again, but the experience traumatized me, so wherever I put my data, it feels unsave. It's a lot of family photo's I want semi regular access to (weekly/monthly).

To be honest I just want to make a few hard drive copies and hand them out to my family members so everyone has a copy, but this seems overkill,

Has anybody else experienced this irrational fear, and what have you done about it?

Are there any actual ways to store my data long term without fear of loss if I'm away again for a long time (I don't care if it's publicly exposed to the internet if that helps)

TLDR: I have an irrational fear of losing my data, anyone else experience this? Any suggestions/solutions?

2079
 
 
The original post: /r/datahoarder by /u/likelinus01 on 2025-04-13 21:45:33.

Hello! I'm looking for an NVME based 8-12 bay enclosure that supports both direct connect Thunderbolt 4 and Ethernet, preferably 10Gbe or 2.4Gbe at the very minimum. This will be used for local storage to edit and then upload to our NAS/DAM other the network.

Does anyone have recommendations or know of any solid units that fit this? I don't mind if it has a PCIe 16x card connected to a main editor, but I still need the Thunderbolt in case we need to download footage to a laptop or external NVME drive to edit a project offline.

Any ideas or suggestions would be greatly appreciated!!!

2080
 
 
The original post: /r/datahoarder by /u/NatSpaghettiAgency on 2025-04-13 21:43:37.

Does anyone else use something like Advanced Intrusion Detection Environment (AIDE) to validate file checksums? I have some NTFS-formatted drives for which it'd be handy (so I could use it similar to ZSF/BTRFS bitrot checker)

2081
 
 
The original post: /r/datahoarder by /u/Here_Be_Drag0ns on 2025-04-13 20:42:55.

I know approximately nothing about tech so if this is a really stupid question please let me know. I've backed up my tumblr blogs using tumblr-backup by cebtenzzre to my computer, so now the question is how to actually upload them to internet archive. Tumblr-backup does not save the blog as one singular file, but as multiple file folders holding [in the case of the blogs I'm archiving] many files each.

2082
 
 
The original post: /r/datahoarder by /u/kevroy314 on 2025-04-13 20:03:03.

I'm interested in annotating some TV episodes and Movies down to the individual scene (or even frame). For example, I might want to annotating Star Trek: TNG S01E03 or Star Trek: Wrath or Khan to indicate the presence of a character on screen. I could then use those annotations to ask questions like "what percent of the show is this character on screen" or "how many total seconds of the show are these two characters in the same room together in a scene?", depending on how I structure the annotations.

As I see it there are two hard-ish problems I don't know the best solution to here:

  1. How do I ensure that if I annotate "+00:14:21.512 to +00:16:01.001 - Picard is on screen" that those time stamps meaningfully map onto the most common or standardized time stamps so others who might want to use them and map them to a video file would be likely to get the same points in time. I've thought about referencing to title screen which would work for files that weren't ripped from TV with commercials ripped. Alternatively, I could standardize on the DVD rip or something. Anyone know good practices here?
  2. Are there any cool tools that people use to create these annotations while doing a watch through? Would love to avoid building it myself.

Thanks for any advice y'all can provide!

2083
 
 
The original post: /r/datahoarder by /u/DiogoAlmeida97 on 2025-04-13 20:00:56.
2084
 
 
The original post: /r/datahoarder by /u/TootSweetBeatMeat on 2025-04-13 19:40:55.

So these high capacity Seagate drives that are cheap on serverpartdeals and in the Best Buy external enclosures that are believed to be binned 30TB HAMR drives...are these safe to put in an enclosure with more than 4 bays?

It was my understanding that at least for some Seagate HAMR drives that they should only be put in a Seagate disk shelf so that it controls how many drives adjecent to one another are spinning at the same time because of low vibration tolerance. Does anyone know if that's the case for these drives?

2085
 
 
The original post: /r/datahoarder by /u/LaundryMan2008 on 2025-04-13 19:12:01.

To begin, I’m posting this a day early before I get home from Spain holiday so I can get plenty of replies with advice so that I can immediately start trying to resolve my roadblock with reprogramming those tape drives so it might be a few hours before I can actually start putting your help to good use and so I can start relying on what worked and what didn’t, those replies will come later unless I have already tried this or to ask a question about it.

I have all of the Linux commands ready to go to transmit the HEX data which is shown in a picture and transcribed below (I used a different command found on the internet as I didn’t want to go to the length of learning how to make that file and for the convenience when I release my megapost that includes a MUCH more detailed and easy to follow instructions to reprogram your drive as the GitHub post is just terrible and required the help of many people to understand it and to get to this point), when I execute the command, the light on the CP2102 USB UART bridge lights up to say that data is being transmitted but the tape drive isn’t receiving it as the sled isn’t powering the tape drive or sending any data, I thought that I could power the tape drive externally with a SAS cable connected to the PC but it still didn’t reprogram and reboot and still showed the error code “E” which means it’s outside of the library and can’t communicate with it.

I also had the LTO-4 sled die on me, the fan stopped spinning so I had to wire up the other SAS sled that I had which was a LTO-5 sled which was a little annoying but I thought maybe the other sled was on it’s way out and refused to power the tape drive but the new sled still did the same and firing the reprogram command still didn’t work, I also noticed the sled had a light on the back to indicate that it’s powered on but it’s not lit up when I plug the MOLEX cable in.

Are there any extra connections (like a connection that shorts 2 contacts together or grounds a pin to let the sled know it’s inserted into a library successfully) that I need to make to be able to have the sled from the tape library power the tape drive or is there a jumper somewhere on the circuit board that I need to connect to power the drive up or is it normal for the tape drive to not have anything on the screen and not be moving and that my command is just bad and I need a different one?

It’s a HUGE roadblock to getting these tape drives fixed as I can’t even begin to test or diagnose the drives as they will not show up in windows under the SAS controller card so I’m beginning to think about letting these LTO-5 tape drives go if I can’t reprogram them as I have been bashing my head against a brick wall trying to reprogram them and the stupid sled is refusing to power the tape drive or relay my commands to it.

How I have it set up

Closer look at the connections, using Blu-Tack to hold the pin headers onto the paperclips but I have received data successfully so it might not be a point of failure, I also held them in with my hand at one point

Out of library error code

The commands that I used, I hit enter so that it would fit on the screen but that enter isn’t present in the command and ignore the other command which is to attach the USB to UART CP2102 bridge in Powershell

2086
 
 
The original post: /r/datahoarder by /u/doodlebuuggg on 2025-04-13 19:05:25.

Sorry if this is too off topic. If it is feel free to delete.

A few months ago I was mailed 11 umatic tapes from an anonymous source that have footage from the canceled Yellow Subarmine sequel- Strawberry Fields. The tapes are moldy and while they have been baked (albeit somewhat poorly) they are in need of a cleaning and above all digitization. The person I mailed them to had his machine break down the same day they arrived and we have been struggling to find someone else who's willing to do this for free. I do not have steady income and cannot pay the extraordinary fees to have these tapes done by a company.

If anyone here has the ability and time to digitize these tapes for us, it would be an incredible help. I am producing a documentary on the studio the film was being produced in as well as building a digital archive of the material that's been recovered.

The tapes are currently in Delaware. Sorry, should've said that instead of Dallas (where I am.)

2087
 
 
The original post: /r/datahoarder by /u/Interesting-Rip-7599 on 2025-04-13 18:31:32.

Hi folks,

As my storage needs grow, I've been considering moving away from my Synology 2419+ (which is used only as NAS, no compute workloads) to a custom build. Ideally, I don't want to deal with old, large, and noisy rack-mounted units. Right now I'm sitting at ~120TB of usable storage, but due to certain limitations of this specific Synology unit (108TB volume size limit), it creates certain inconveniences that I'd like to avoid in the future. With that being said, here's the list of my requirements:

  1. 300 - 400TB usable capacity in the next 2-3 years.
  2. Hot swapping
  3. At least 2.5G networking, probably dual NICs, but that's not a hard requirement
  4. No need for redundant PSU, since it won't be running anything "mission critical" and I'd like to keep things relatively quiet and power efficient.

I'm not 100% sure if my requirements are throwing me into a more enterprise-ish category, but I've been considering one of the 2 routes:

  1. A regular full tower case, something like FD Meshify 2XL.
  2. 45Drives Storinator AV15.
  3. Other options?

I totally understand that I'm comparing apples to oranges with these 2 options (one being simply a case, while the other is a barebones, production-ready NAS), but I'm honestly not sure which path to take. On one hand, using consumer-grade hardware has its own appeal (cheap, not as power-hungry, widely available - I have lots of good components I could use without spending extra). However, it looks like it's pretty challenging to find high-capacity cases for needs similar to mine, so something like the second option - a purpose-built platform with redundancy and reliability built-in might be a better fit.

I'm curious if y'all have other recommendations/comments regarding my setup.

2088
 
 
The original post: /r/datahoarder by /u/threwusall on 2025-04-13 17:30:37.
2089
 
 
The original post: /r/datahoarder by /u/DV2FOX on 2025-04-13 16:19:59.

While i actually use a 1TB EVO 860 for my OS, my 850 EVO 500GB is starting to be low of space, so i thought of upgrading it to 2TB.... That, and that the actual economy is getting troublesome so before prices spikes the hell out i'd rather get a new SSD!

I heard long time ago that SAMSUNG's EVO 870 SSDs were having a bad batch, but after some years i wanted to ask:

-Have they solved the issue right out of the box? (No news from SAMSUNG's side, that's why). If so, can i check wherever outside of the box part to see if i'll get a fixed version?

-Would a firmware update be needed?

-Is the 2TB model safe?. Heard below 2TB it is but 2TB and above could be troublesome

-How are the writting speeds compared to EVO 850 and 860?

(Can't use a M.2 due to trying to put one almost incorrectly in my Mobo as an OS and it made the slot smell, so i don't wanna try putting anything there again... Rest of PC runs ok on my 860, so better evade that slot until i get a new mobo and do it "right")

A 870 2TB actually costs 158€ and the 1TB 109€ so i think the difference might be worth it, but asking about the issue above first just in case

Thanks in advance!

2090
 
 
The original post: /r/datahoarder by /u/EspritFort on 2025-04-13 15:50:46.

I'm aware that this is preaching to the choir and that most of you will already have some automated yt-dlp setup running (or even stocking your Jellyfin library directly with Youtube-content via pinchflat or similar), but if you're not then I'd like to give you another reason to start sooner rather than later:

I think I'm witnessing an increasing trend of channel owners retroactively putting old videos behind a channel-member paywall.

(Maybe it's just my own subscriptions, I'd rather be crazy than right in this regard)

So in addition to content violations, intellectual-property-related takedowns, georestrictions, IP-bans and Youtube constantly doing their best to permanently break download tools I now feel I'm also racing against the channel owners themselves in trying to ensure permanent access to my preferred media selection.

If you like it, download it now. At some point in the near future it may no longer be possible at all.

2091
 
 
The original post: /r/datahoarder by /u/nib1nt on 2025-04-13 15:47:02.

Hey everyone, I have been testing my web search scraper - it can run 10k+ searches per hour.

I need ideas to create demo projects. We could then load the search results into a vector db and build a RAG etc.

May be something like:

  • ${city} ${keyword} to build city profiles around a topic.
2092
 
 
The original post: /r/datahoarder by /u/MorCJul on 2025-04-13 15:17:54.

Hi all,

I'm planning an offline + offsite long-term backup (Edit: of selected ultra-important) family photos and would love a sanity check from the community.

I own an LG BH16NS40 (2013 model) internal Blu-ray writer with support for writing BDXL and M-DISC. According to the original manual (2013) and LG support (as of 2021), it however officially supports M-DISC DVD+R SL only, not M-DISC BD.

I'm considering three M-DISC DVD options:

I'm leaning toward the Ritek discs, since they appear to be officially licensed and are cheaper.

With concerns over the long-term reliability of modern Verbatim BD M-DISCs (especially multi-layer ones), I’m thinking M-DISC DVDs still make the most sense. Around 4GB per disc is actually a good size for organizing photos, ideal for specific, holidays, or events, without overloading any single archive.

Edited for clarification: Do you consider RITEK M-DISC DVDs to be a good solution compared to the more expensive Verbatim or Millenniata M-DISC DVDs? I already follow a 3-2-1 strategy with NAS, external HDDs, and cloud. This is more about creating an additional ultra-long-term offline+offsite copy of a limited, curated set of JPEGs. Any insights or experiences would be greatly appreciated!

2093
 
 
The original post: /r/datahoarder by /u/vinznsk on 2025-04-13 14:15:59.

Hey guys, I found this on Amazon: https://www.amazon.com/dp/B0DW8ZW47C

It is 22tb for 249 which makes it $11.32 per TB which I think is a good deal compare to recent prices increase from SPD and GHD on Ebay.

I'd like to buy one of those, shuck it and put it into my NAS.

How do I know if this can be shucked. I've never done it before.

2094
 
 
The original post: /r/datahoarder by /u/MudAffectionate361 on 2025-04-13 12:53:23.

Hi all - am a premium/home customer.

uploads are way below 10tb, but linked my opendrive to rclone. I did not subscribe to Opendrive to hoard data, but just to keep my more valuable multimedia items, and access them via Rclone when needed.

Suddenly my downloads are being throttled to 500kb/s which is causing severe buffering. This is not what I signed up for - the terms and conditions say that "OpenDrive does not throttle download speeds on any of its plans, including the free one" I've tested in multiple locations, with/without VPNs, and the speed is the same

Can someone please advise.

If this is a limitation of Opendrive, I'm going to have to migrate elsewhere - but the terms and conditions strictly say

Premium accounts are supposed to have unlimited downloading speeds.

Thanks

  • There are no clear terms or notices that premium users should expect throttling or speed limits.
  • While they mention "excessive usage" for storage or bandwidth on Unlimited plans, this mainly refers to uploaders and large-scale storage use, and my usage doesn’t come close to those limits.

https://preview.redd.it/8v7sei0wmlue1.png?width=741&format=png&auto=webp&s=3e567dd73210f4cae2dafab2965ee5eb387d4a7d

2095
 
 
The original post: /r/datahoarder by /u/angegowan on 2025-04-13 12:42:45.

I hooked a drive to a really old laptop I had rebuilt and was missing drivers for a lot of my files. That got me thinking that I need to make sure my files are in the most universal format possible. Documents in pdf and non Adobe pdf reader on all devices and drives, books as epub, sound files as mp3, pictures as jpg. What format would be best for my video files? I am pursuing accessibility instead of lossless storage obviously. I use windows/android devices and vlc media player and have a large codec library but what if I need to connect my drives to a basic device?

2096
 
 
The original post: /r/datahoarder by /u/sweatydoodoo on 2025-04-13 11:42:18.
2097
2098
 
 
The original post: /r/datahoarder by /u/Maratocarde on 2025-04-13 11:12:15.

The Internet Archive's 'Great 78 Project' digitizes historical recordings to preserve musical heritage, but in 2023 the initiative led to major record labels filing a copyright lawsuit. The financial stakes soared last month when the labels proposed to update their claim to $693 million in statutory damages. A recent filing suggests that due to significant progress in settlement discussions, it may not come to that.

+++++++++++++

FULL ARTICLE:

https://torrentfreak.com/internet-archive-v-music-labels-500m-copyright-rift-edges-toward-settlement-250409/

Where to follow the lawsuit (and get updates):

https://www.courtlistener.com/docket/68101636/umg-recordings-inc-v-internet-archive/?order_by=desc

Read IA's response:

https://blog.archive.org/2023/08/14/internet-archive-responds-to-recording-industry-lawsuit-targeting-obsolete-media/

2099
 
 
The original post: /r/datahoarder by /u/GenericUser104 on 2025-04-13 09:40:40.

If your from the UK what price per TB would you generally pay ?

2100
 
 
The original post: /r/datahoarder by /u/Eco-Libertarian on 2025-04-13 04:54:54.

After reading a lot of very contradictory posts about which drives are loud and which are quiet I've come to the conclusion that people mean different things when that complain about noise.

I'm only concerned about the sound of the actuator moving not sound the drive spinning.

So for those who have experience with more than a handful of drives, please chime in on, which are the best refurbished 16TB drives to get?

Use case: plex server 10 feet from by bed (no I can't put it in another room).

view more: ‹ prev next ›