It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
3226
 
 
The original post: /r/datahoarder by /u/daronhudson on 2025-02-21 20:28:34.

Hi, I’m new to the whole storage game. I currently run a 32TB nvme system. I do however want to move away from storing everything on nvme just so I can prolong their lifespan a bit more. I’ll be doing general purpose storage and archiving.

I’m looking into SATA hdds to get on the cheap. I won’t need crazy amounts of storage, but ideally around 20tb in 7 disk with at least raid5.

What would your recommendations be on getting? If I can get more storage for less, then that would be even more ideal. I’m not looking to spend crazy amount of money, but I would be willing to put down a few hundred bucks.

3227
 
 
The original post: /r/datahoarder by /u/exsuprhro on 2025-02-21 20:19:29.

Hopefully this is the right place. I'm wondering if anyone anywhere has tried to put together a comprehensive list of all the data sets under threat (that we know of), or already deleted?

I can't believe this is a conversation I'm having in the United States.

3228
 
 
The original post: /r/datahoarder by /u/Yacht_Taxing_Unit on 2025-02-21 19:46:36.
3229
 
 
The original post: /r/datahoarder by /u/Spiritual_Bar_9000 on 2025-02-21 19:36:04.

Hi guys, first time posting so please be gentle.

Looking to build a NAS for the first time after binge watching YouTube for 2 weeks.

Price is not any issue, but I do want to be cost-efficient (don't wanna underpowered but no point in a 14900 right?)

Goals in decreasing importance 1- data storage/backup (have 2 10tb and can shuck another 3 8tb externals if needed) 2- plex 3- pi-hole 4- vpn 5- experimentation (arrrr?)

This is just to get my feet wet, probably will end up building a second one if needed. So looking for best bang for the buck so to speak.

Also, any software or app recommendations? Still on the fence about unraid vs truenas. Heard containers or docker is nice? Definitely looking to remote in and automating pc/phone backups. Maybe sailing the seas?

If anyone has asked this before this year, I apologize first and would greatly appreciate a redirect.

3230
 
 
The original post: /r/datahoarder by /u/Insergence on 2025-02-21 19:22:06.

Just recently bought two $280 BestBuy 24TB Seagate Expansion and opened them up to find Barracuda labels. ST24000DM001 and the specific model of the expansion is STKP24000400 and PN is 3JSAP4-570.

3231
 
 
The original post: /r/datahoarder by /u/Crafty_Split_1 on 2025-02-21 19:20:38.

I tried videodownloadhelper but the video is out of synch with the audio in some places

3232
 
 
The original post: /r/datahoarder by /u/PricePerGig on 2025-02-21 18:39:02.
3233
 
 
The original post: /r/datahoarder by /u/Celcius_87 on 2025-02-21 16:20:49.

I've only been hoarding data for a few years and so far I have about 675GB which is over 100k files. I know many here have MUCH more data though, and as my data grows I'm thinking about protecting the data. I have multiple offline backups but next I want to learn more about preventing corruption.

I use windows 11 24H2 and currently just copy my data to external WD hdd's using windows file explorer, no 3rd party apps. I have DDR5 non-ECC memory. So far I've never had one of my files later become corrupted in my entire life (at least, that I'm aware of).

How can I verify the integrity of all my files after every time I do a copy to backups? How long does verification normally take? Also, is there anything I can do to further prevent corruption in the first place in case restoring the original file may not be possible?

Is is possible to do this while staying on Windows or would you eventually have to switch to a different OS like ZFS? Is MacOS any better than Windows in this regard?

Any resources for learning more about file verification and preventing corruption? Thanks

3234
 
 
The original post: /r/datahoarder by /u/MidnightOpposite4892 on 2025-02-21 15:42:40.

I want to purchase something to keep my important data off my PC, something that I wouldn't carry around so it will always be at home and also reliable in terms of long term storage. Which one is better for my needs: an external SSD or HDD?

3235
 
 
The original post: /r/datahoarder by /u/neurocrash_ on 2025-02-21 14:36:22.

I've tried this app, and while it seems to identify the needed cuts, it crashes when you try to process, and is perhaps abandoned.

https://github.com/pathartl/BananaSplit

3236
 
 
The original post: /r/datahoarder by /u/IaryBreko on 2025-02-21 14:16:47.
3237
 
 
The original post: /r/datahoarder by /u/Jaded_System_7400 on 2025-02-21 13:52:46.
3238
 
 
The original post: /r/datahoarder by /u/DiskBytes on 2025-02-21 10:46:27.

I've just put some stuff onto LTO tape, using mbuffer, it reported the summary as follows

summary: 18.8 GiByte in 6min 11.7sec - average of 51.9 MID/S, 5x full.

What does the 5x full mean?

3239
 
 
The original post: /r/datahoarder by /u/FishSpoof on 2025-02-21 10:40:44.

Does anybody have a plan for their data long term? I have tens of terabytes and I imagine by the time I'm 70 I'll have hundreds of terabytes or more hopefuly! Then what ?

My kids will probably trash my stuff or list it on eBay.

Has anyone thought about this ?

3240
 
 
The original post: /r/datahoarder by /u/Traditional_Media889 on 2025-02-20 20:54:39.

I hate disks and love my NAS.. so it's time to rip them all.

Before I relearn what's changed in the world of ripping disks (it's been a few years since my last and tech has probably improved)... what are my options on my:

i91300KS

4090 24Gb

HEAPS of space, but would 'like' a 1-1.5Gb per episode, with subtitles.

I'll be watching back on my 86" TV and one day, a home projector (happy to get them out of storage and re-rip then potentially).

Is CPU still better/slower than GPU, or has that changed?

I've just re-installed handbrake and ready to go when they arrive.

If there's any tips, tutorials or help that anyone could offer in 2025, I'd love to hear it!

TNG has a special place in my heart and I want to do it justice but protect my purchase by not getting the discs out very often.

3241
 
 
The original post: /r/datahoarder by /u/XlhHarley_Greg on 2025-02-20 19:59:11.

Hello and good afternoon Is there a AI program out there that will browse all the web sites for a given area and report back on the findings.

Example: I want to see a list of all the hotels in Berkeley County West Virginia.

Right now all I get, via Google, is the hotels that pay to be searched. Even ChatGPT gave me only 10 and there is 20 Thanks

3242
 
 
The original post: /r/datahoarder by /u/Relevant-Team on 2025-02-20 14:54:09.

Europe's largest computer magazine, c't from Germany, has published new insights to this scandal. Unfortunately, I'm not able to provide the text of the article at the moment...

https://www.heise.de/select/ct/2025/5/2503107483634061867

At least the tools for identifying fake HDDs are available here:

https://www.heise.de/select/ct/2025/5/softlinks/ydqp

Please check your Seagte HDDs if you bought them with deleted SMART values but high FARM values.

3243
 
 
The original post: /r/datahoarder by /u/Royal_Ad_9196 on 2025-02-20 12:11:27.

I have found the HDD SEAGATE ST20000NM007D EXOS X20 20TB at 350€ in my country but I also found that link https://forums.unraid.net/topic/146490-things-i-learned-about-the-seagate-exos-drives-and-how-to-fix-them-if-you-encounter-random-shutdowns-or-read-errors/

So I am also considering the seagate iron wolf pro 20tb but are at 380€. Should I have the same problem using a enteprise drive ? Is the ironwolf more trouble free and less noise for the 30€ diff I considering buying 3 for a raidz. And most important I would rather not mess with the drive in case I void the warranty.

Thanks in advance

3244
 
 
The original post: /r/datahoarder by /u/Original-Climate-971 on 2025-02-20 05:27:15.

Hi, so I accidentally deleted my entire external hard drives. I downloaded a restore program and restored many of the video files (without file title sigh*). I noticed that some of the video files I played after restoring would play for some time and then go to a black screen until the video time ended. Furthermore, all of these restored files were restored with the same size they had originally, so files that are able to play partially would have unplayable part still tacked onto the file. I wish I could keep the good parts of the video and delete the corrupted black screen part. Is there a program that would help me sort through the video files that have back corruption and the ones that don’t? Is there a way to somehow restore the video titles for easier sorting. Also, I’ve heard of video file repair programs, could this work in this situation?

3245
 
 
The original post: /r/datahoarder by /u/jobedois on 2025-02-21 09:22:01.

Hello, sorry, if this asked before I'm not sure what to search for.

Does anybody now of a program that let's me subscribe to Youtube (or other video sites) and displays the feeds (e.g. Freetube style) where I can then download/archive single videos of my choosing for offline vieweing without downloading the whole channel? TubeArchivist/Pinchflat/TubeSync seem to only be archiving whole channels and most of the YT-DLP GUIs I could find only download an URL you paste to some folder (lacking the channel subscribtion / viewing feature).

I'd be very thankful for any tips!

3246
 
 
The original post: /r/datahoarder by /u/Ostenblut1 on 2025-02-21 09:05:49.

Hi everyone, I need to buy a usb drive or another secure storage solution for my recovery codes. I am a little anxious person I have 3 2FA keys and I want to store my recovery keys in to something really reliable.

3247
 
 
The original post: /r/datahoarder by /u/RainOfPain125 on 2025-02-21 08:55:27.

Hello friends,

I'm trying to work with the hardware I have - sadly all consumer stuff that doesn't support ECC RAM.

However I understand there are other means of trying to detect and correct errors, like the data integrity features of the Btrfs filesystem.

I'm wondering how far Btrfs can go in terms of detecting & correcting errors, as well as wondering if there are any other solutions within RAID software, etc.

3248
 
 
The original post: /r/datahoarder by /u/Bag_of_DIcksss on 2025-02-21 08:55:13.
3249
 
 
The original post: /r/datahoarder by /u/RainOfPain125 on 2025-02-21 08:44:39.

Hello friends,

If I understand things right, then it would seem that RAID6 would have poor scaling in terms of safeguarding against data loss via drive failure. No matter how many drives, only 3 drives have to fail to cause data loss.

If you're only using the minimum of 4 drives, then this might not be an issue.

But if you were using 12 drives, 20 drives, or so on - the odds that a drive failure will occur increases. If I ran 1,000 drives on RAID6, it would still only require 3 of them to fail to lose the data. This is what I mean by poor scaling.

So it begs rhe question, what setup do enthusiasts and/or datacenters use in their larger arrays to mitigate data loss? What is a setup that can scale better as more drives are added to the array?

3250
 
 
The original post: /r/datahoarder by /u/Andre1661 on 2025-02-21 08:19:06.

I have concerns about a huge online information source that, even just a few months ago, I would have thought was a secure and publicly available database. I’m referring to the National Weather Service.

No matter which weather forecasting app you use on your phone or desktop they are all based on information that comes largely from a single source: the National Weather Service. For over a hundred years they have been collecting hourly weather data from thousands of weather stations, radiosonde balloons, aircraft, etc. It is a priceless trove of data and one which cannot be replicated for any money.

Recently I heard rumblings that corporations were hoping a Republican government would privatize that government agency, meaning one corporation would control all weather data. So, if your local TV station or phone app wanted to provide you with a forecast for the coming week it would have to shell out big money for the data. Every single week.

Given what has happened since the Musk-Trump Shredder of Lunacy has been let loose inside the federal government I am worried that, once they have hit the high value targets (Accounting Office, the IRS, the OMB, etc,) they will turn their attention to the science agencies.

I would suggest that someone who has good database and data hoarding skills take a look at the weather service and NOAA websites and seriously consider starting to archive as much data as they can. Or provide us concerned citizens with some guidance about how we can help preserve it.

As I mentioned before, this is a priceless database and once it is sealed into a corporation’s server farm, will it ever be given back to the people who paid for it: the public?

view more: ‹ prev next ›