It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
3526
 
 
The original post: /r/datahoarder by /u/JLJFan9499 on 2025-02-13 05:05:41.

So I hoard older physical PC games and now Steam subreddit is saying how stupid I am, that Steam is reliable source for gaming needs and that physical media is stupid. My argument is that I don't need to worry about my account being revoked one day for whatever reason and that Steam is not a long term solution for game ownership/preservation. Am I wasting money by buying physical media? Should I focus on Steam for now on? Or should I keep buying old physical games before Steam activation was a thing? I've always gone left when others go right but now I'm questioning my choices.

3527
 
 
The original post: /r/datahoarder by /u/-ThatGingerKid- on 2025-02-13 05:02:18.

I have set up an Undraid NAS server at home. I can't afford to build a second NAS right now. I'm thinking about (for the time being) regularly backing up all my data both to a large personal external hard drive, and a Hetzner storage box. I'm still learning the ins and outs of secure backup, and avoiding all possible failures (drive failure, natural disaster, malware, etc), so I'm curious what you do.

3528
 
 
The original post: /r/datahoarder by /u/lordofcatan10 on 2025-02-13 04:30:28.

Title says it all. Looking to fill up my drives with useful stuff. OS works and I have a good Internet connection.

I’m a biological data scientist so interested in that type of field. Anywhere I should start with deciding what to back up and air gap?

3529
 
 
The original post: /r/datahoarder by /u/NathanDTWally on 2025-02-13 03:49:38.

Can this RAID mirror properly? 4TB NAS drive w/ 4TB Surveillance drive

I recently built a tinkering proxmox server with two identical SSDs mirrored with 4 HDDs

2 HDDs are identical 8TBs and will be mirrored likely. But I have two 4TB HDDs, one a NAS drive, and one a Surveillance drive (from different companies worth noting?). Am I able to raid these? Or am I better off not.

I really don't plan to use the one drive as surveillance I just had it available to me at the time of the build.

3530
 
 
The original post: /r/datahoarder by /u/True-Entrepreneur851 on 2025-02-13 03:44:09.

I need some help and guidance on setting up my backups as I am facing difficult choice and options. I have the following setup : 1 Synology NAS 423 where I store different things in 4 folders around 20 TB all data to backup. 1 HD 10 TB and 5 drives 4TB each.

I have Duplicacy on my pc that is connected to the NAS through wifi.

I would like to backup my NAS, first thing I did was to use Windows Storage Space to manage a RAID0 drive for backup, works great and now I have 10TB + 12TB for backup storage. Problem is backup from PC is very slow, reaching 50 MB/s.

I am thinking now about two options to make it faster :

Setup Duplicacy on my NAS and backup from NAS. The problem is that I have only 2GB of RAM, should I buy more ? Besides this I am not confident the RAID created by Windows storage space will be recognized as such by my NAS. I am also having big pain to setup duplicacy as they are not clear on which version should be used for my Synology, is it Duplicacy web ? I am very newbie and considering also BORG as I found the package for DSM but not sure it is easy to setup..

Other option : I keep using Duplicacy on my pc, I buy a long ethernet cable and plug to my NAS. My question there : will it be MUCH faster than 50 MB/s ?

Other points to consider : I want to avoid buying a 20TB drive because I see it as a waste of money given that my 4x4TB are in good conditions and I find it better for my bank account compared to price of 20TB disks. I do monthly backups for Home use, no need to have something too much elaborated.

Thanks for the help on this.

3531
 
 
The original post: /r/datahoarder by /u/R3UO on 2025-02-13 02:41:07.

Hey r/datahoarder!

I built a linux tool that helps organize/find/recommend related content in video libraries using machine learning (bayesian math) and VLC.

Key features:

  • Uses VLC for playback and user feedback (space/stop keys for classification)
  • Learns from your file naming patterns
  • Handles any language/character set
  • Saves as standard M3U playlists
  • Optional size-based classifications (prefer larger/smaller files, larger/smaller dirs)

Limitations:

  • Linux (for now)
  • Operates on video metadata (file name, path, size, etc) not content, so there should be some common information present video library across file names/paths.

Try it out!

Installation requires the rust package manager cargo: cargo install classi-cine

Basic usage:


Build a new playlist from your video directory
==============================================

classi-cine build playlist.m3u ~/Videos

List what you've liked/disliked
===============================

classi-cine list-positive playlist.m3u
classi-cine list-negative playlist.m3u

It's open source (MIT licensed) and written in Rust. Might be useful for anyone managing large video collections.

GitHub: https://github.com/mason-larobina/classi-cine

Let me know if you have any questions!

3532
 
 
The original post: /r/datahoarder by /u/Such-Bench-3199 on 2025-02-13 00:55:17.

My dad gave me a WD My Cloud Duo 16TB NAS (he refused to listen to me at the time, and was convinced every drive is the same) even though we had two Synology's at the time. He wanted it for his photos (he uses photoshop) and money doesn't matter to him. He eventually realised it need to be connected to the internet to be able to use it, that wasn't going to work for him, he didn't have the energy to return it and gave it to me.

Unfortunately, he took the time to set it up before he gave it to me, so now whenever it gets full (I think less than 5TB), or shuts off because its hot, it "phones home" (his email is associated with it) to tell him, then I get yelled at (he is 70)

I get it, he doesn't want his house to burn down, but still.

My strategy/current plan is.

I want to buy a normal 16TB, no NAS, no fans nothing, backup/clone the source drive, then turn it off, take the drives out, wipe them both, and have two free 8TB drives.

I see the pro's the only con is the price of a new 16TB drive. It's cheaper if I get it online, it will take a few days, but if I buy it today and get started on it, it's more expensive.

The difference would be about $200

I have tried in vain to stop it "phoning home" and I can't figure out a way to remove his email, I even tried getting onto his computer and blocking WD sending him emails, but either he reversed it, or they found another way.

Is there any other avenue I can consider? will this work?

3533
 
 
The original post: /r/datahoarder by /u/Chimetalhead92 on 2025-02-13 00:47:51.

Hi r/Datahoarder

I’m not really sure if this is the right place for this but I have zero experience archiving or backing up anything and I just kind of need to know where to start. What equipment to buy etc.

I’m very passionate about pro wrestling, and in an era of streaming (the WWE Netflix deal will be making decades of art inaccessible) and even more so small streaming services like IWTV that aren’t connected to a large corporation, or even just YouTube so much of the art I have come to love could be inaccessible.

Simply put, what kind of equipment or programs would I need to download and archive hundreds of hours of pro wrestling from online or streaming sources?

I’m such a noob I don’t even have a computer, just a barely used tablet and a phone.

Any help is greatly appreciated.

Thank you

3534
 
 
The original post: /r/datahoarder by /u/CRVDriver on 2025-02-13 00:31:30.

So I am an avid photographer and currently store my photos in my pCloud lifetime account as well as three drives (2 SSDs and one hard drive) which all have a copy of what is in my pCloud account. I really want an additional off-site backup, as I have been in a number of house fires and break ins and just want to be safe.

My YMCA has lockers that can be rented. I had the idea today of renting one and placing an SSD with an encrypted backup of my photos on it. Would this be a good idea? I figure the chance of it getting broken into would be less than that of a safe deposit box (who breaks in to a locker to steal underwear lol), and it would allow easier access because I can access it whenever I work out.

3535
 
 
The original post: /r/datahoarder by /u/Poisonslash on 2025-02-12 22:47:29.

Hello Data Horders,

I've been trying to check the health status of my drives because I was curious to see how they're doing, but I'm quite confused about the Wear Level Count in the S.M.A.R.T statistics.

Looking online I've found two totally opposite answers; the first being that a LOW wear level count indicates that the drive has barely any wear on it, but at the same time other's have said a low value indicates that the drive may fail soon.

First I checked using CrystalDiskInfo v7.6 which I already had installed, as well as on Samsung Magician. This came back with my SSD having a Wear Level Count of 1, and stated the drive is in good health:

https://preview.redd.it/mavs8g0gdsie1.png?width=1747&format=png&auto=webp&s=960e5918dbcb2ac4dc77141e29f79a2e238bdc61

I then realized that my CrystalDiskInfo was quite outdated, so I picked up the newest version and this is where the confusion spawned from. As you see below, it's stating that my drive health is at 1% and cautions about the Wear Level Count:

https://preview.redd.it/7aglzaoudsie1.png?width=999&format=png&auto=webp&s=67ad3606b3417408426c666a1fcddd121028142a

So I'm just wondering for those more familiar with these statistics, is this possibly just a false reading from the 9.5.0 version of CrystalDiskInfo, or does my drive actually have an issue? This is the main drive in my PC with the operating system, so it's not like I'm using it as storage, gaming or big file transfers. I would assume it shouldn't be dying this quickly compared to my other drives that I regularly write and delete from?

3536
 
 
The original post: /r/datahoarder by /u/ProfessionalSolid692 on 2025-02-12 20:25:18.

Do you keep spare drives around so that you can quickly replace a drive after a failure?

3537
 
 
The original post: /r/datahoarder by /u/heff66 on 2025-02-12 19:45:44.
3538
 
 
The original post: /r/datahoarder by /u/RFilms on 2025-02-12 19:25:43.
3539
 
 
The original post: /r/datahoarder by /u/marrthecreator on 2025-02-12 19:08:29.

Hello,

I come to you humbly! I'm not sure if this is the right place so please forgive me if this isn't the right place. I run a small company that’s hell-bent on making a difference in the lives of children who have or had an incarcerated parent. We’re working on a project to raise awareness of the challenges these children face through data-driven storytelling and visualizations.

I’m looking for reliable datasets related to:

  • The number of children with incarcerated parents (preferably broken down by state or region)
  • Demographic information (age, race, socioeconomic status)
  • Outcomes related to education, mental health, or other relevant indicators for these children

We’ve hit multiple roadblocks in our search so far. Many schools either aren’t capturing this data because it’s not seen as a priority, or they simply don’t have the capacity to track it. If anyone knows of publicly available data sources—government reports, research studies, or anything similar—I’d be incredibly grateful for your help. This data will help inform our advocacy efforts and inspire real change.

Thanks in advance for your time and suggestions!

3540
 
 
The original post: /r/datahoarder by /u/JoinHomefront on 2025-02-12 19:07:06.

Is anyone working on archiving the content at Deployed Medicine? I searched the subreddit and found no mention of it and don’t see it mentioned in the US Government ArchiveTeam wiki. The TCCC material is incredibly valuable and we could do with a backup. I don’t have personal device space to be able to fetch all of it.

3541
 
 
The original post: /r/datahoarder by /u/rbarr110 on 2025-02-12 19:01:11.

I have a boss that is insisting on a quarterly physical backup of the server data that can be stored off-site. Our server currently has daily cloud backup, but boss is paranoid that if the service we use shuts its doors, we wont have backups anymore.

Is there a NAS solution that will copy/backup the server data and say we pull a drive to store data offsite and install a drive in its place, then the data gets rebuilt on the new drive and continued backup until we swap a drive again quarterly. Does that make any sense?

3542
 
 
The original post: /r/datahoarder by /u/Low_Variety_4009 on 2025-02-12 18:35:02.

Hi everyone,

I wanted to contribute to the community of people who like to legally hoard backups of their movies and TV shows.

I scoured the internet to find information on how to create very high-quality recordings without the file size getting too large. Audials Movie 2025 is the best software I could find to achieve this. Yes, it's pretty buggy. And yes, anything above 1x recording speed seems to not work at all for most people, which could be false advertising. However, as far as I know, it's still the best option.

What settings should I use to achieve the best balance between quality and file size?

Base Profile: H.264 High Quality [GPU] - slow, large file

Container: MKV

Video Properties

  • Codec: H.264 (Yes, H.265 is more efficient, but it's pretty demanding on your system—impractical for background recording.)
  • Frame size: Original
  • Frame rate: Original
  • Bit rate: Exact 8544 kbit/s (Uncheck VBR)
  • GPU bit rate: Exact 8544 kbit/s (Uncheck VBR)

Audio Properties

  • Codec: AAC
  • Bit rate: Exact 320 kbit/s
  • Channels: Original

Recording Settings

  • Always use the internal Audials Movie 2025 web browser to record.
  • Use 1x speed.
  • Enable GPU encoding (if available).

If you have any questions regarding these settings, feel free to ask!

3543
 
 
The original post: /r/datahoarder by /u/West_Dickens on 2025-02-12 18:30:23.

https://youtube.com/user/ELPRESADOR

As per the title. Elpresador's YouTube channel was restored a few days ago by mere happenstance. According to his past livestream on his alt channel Quantrell Bishop, he accidentally logged into the old account with a prompt to appeal his original termination from 2018. He never got the chance back then, and decided to do it now on a whim not thinking anything would come of it -- but a few hours later -- it was back. It's remarkable and truly extraordinary.

For any ardent fans of Pres' videos in the past, or his career as one of the funniest entertainers online, now's your chance to dive through the treasure trove of his old videos that were seemingly lost to the sands of time and back them up to your hearts desires!

There's almost 5½K videos on there that are in desperate need of archival storage for the sapience of future generations. I felt like I needed to get this out there, because I know how gutted I was when I first saw the news of his channel being nuked...

Godspeed!

3544
 
 
The original post: /r/datahoarder by /u/idyllrain on 2025-02-12 18:29:46.
3545
 
 
The original post: /r/datahoarder by /u/Nandulal on 2025-02-12 18:20:09.
3546
 
 
The original post: /r/datahoarder by /u/ktktkt1 on 2025-02-12 17:37:12.

I have 300tb data and they are all in 12 to 24tb wd or seagate external usb hdds. Backup is 1:1 so same size drives but different brand or model.

I am considering LTO setup. It looks like lto6 drive (under $600) is much cheaper than lto7 (around $2500) used?

Should i go all in on lto 6 or bite the bullet and go with newer gen?

Can i bitlocker encrypt the tape? I use windows

3547
 
 
The original post: /r/datahoarder by /u/AshleyAshes1984 on 2025-02-12 16:33:43.
3548
 
 
The original post: /r/datahoarder by /u/thunderousqueef on 2025-02-12 16:17:26.

I just wanted to give you all a quick shout and relay how important you all are to data preservation during a time when evidence and history are being erased before our eyes.

Thank you. You will receive your flowers, if not tomorrow, the next day.

3549
 
 
The original post: /r/datahoarder by /u/nicsaweiner on 2025-02-12 15:33:53.

Just got an IT job replacing an old head who retired. His office is a dumpster fire, but as I clean it I keep finding more and more old software. There is seriously soooooo much of it. Hundreds and hundreds of burned CDs with sharpie labels. Tons of jewel cases and even binders filled with various software. It's random crap like OSHA spreadsheet software, about 50 different versions of Adobe products, or various Windows installs that go back to the early 2000s. I feel bad throwing it all out, but it's pretty much useless to me and it also might have sensitive company info on some of them, so I can't just dump them all on the Internet. I just wanted to share my find with some people who would appreciate it. In a better world I could dump a software mountain on you all right now.

3550
 
 
The original post: /r/datahoarder by /u/pepitamonster111 on 2025-02-12 13:30:11.

Federal data is disappearing. On Thursday, meet the teams working to rescue it and learn how you can help.

Join the Internet Archive and the Library Innovation Lab on Feb. 13, 3pm Eastern for a free webinar exploring the terabytes of data they have already saved and how to access it.

https://www.muckrock.com/news/archives/2025/feb/10/federal-data-is-disappearing-on-thursday-meet-the-teams-working-to-rescue-it-and-learn-how-you-can-help/

Register: https://us02web.zoom.us/webinar/register/WN_YEWblXS7Tge8ax_Io7WW8w#/registration

view more: ‹ prev next ›