It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
276
 
 
The original post: /r/datahoarder by /u/Putrid_Draft378 on 2025-07-24 13:20:46.
277
 
 
The original post: /r/datahoarder by /u/ActuallyApathy on 2025-07-24 12:57:52.
278
 
 
The original post: /r/datahoarder by /u/Duck_Dur on 2025-07-24 12:33:49.

Hello all,

How should I go about upgrading my NAS drives, I currently have 4 8TB drives in a RAID 10 config, how would I upgrade my drives to say 12TB drives without losing any of my data during the upgrade while still keeping RAID 10?

279
 
 
The original post: /r/datahoarder by /u/SweetRefrigeratr3012 on 2025-07-24 09:49:52.

Hi everyone,

I’m using dupeGuru to find duplicate photos, but I’m running into two big problems, and I’m wondering if I’m missing a setting or doing something wrong.

  1. It doesn’t find all duplicates in one scan! I have a large photo collection (over 100,000 files). When I run dupeGuru, it only finds some hundred duplicates. Then, after deleting them and scanning again, it finds more. I have to repeat this process many times. Is there a way to make it find all duplicates in one go?
  2. Sometimes when I find duplicates, I can’t select them to delete right inside the app (the checkbox is greyed out). Instead, I have to click the result, open the folder, and manually delete the file in Explorer.

Any help or tips would be really appreciated. Thanks in advance!

280
 
 
The original post: /r/datahoarder by /u/hiroo916 on 2025-07-24 09:21:53.
281
 
 
The original post: /r/datahoarder by /u/maxtrix7 on 2025-07-24 09:03:37.

Hi, I'm looking for an affordable way to fill my new NAS without breaking the bank.

HDD Toshiba MG07ACA14TE 14 TB 3,5" 8,89 cm 6G SATA 7,2K P/N: HDEPW10CGA51

The listing says it comes with 1 year of warranty and costs 160 euros

Worth to buy? I want to populate a Synology DS418play.

282
 
 
The original post: /r/datahoarder by /u/mikeage on 2025-07-24 09:02:49.

Hi, I have about 1800 journal articles archived and I'm looking for an easy way to query them. All have full text (no weird OCR limitations), but they're in different languages with a lot of transliteration (and often inconsistently so), so I'm thinking that a simple keyword search is probably not sufficient.

I use paperless-ngx to index documents, and I looked at adding paperless-ai to it, but when I tried with my current archives, I was very underwhelmed (and frustrated; it tagged a lot of my stuff with nonsense and the Reset option, which I understood from the documentation would remove the changes it made, didn't, so I'm a bit bitter about having to manually undo a lot). But in any case, the way it organizes by correspondent and type is probably not really what I want.

Any suggestions for something that might be more suited for this type of indexing?

283
 
 
The original post: /r/datahoarder by /u/Linuxdr0ptips on 2025-07-24 08:37:16.

Do you think it is reasonable to pay 878USD for a WD SN850X 8TB nvme ssd ? Or there are any other SSD to recommend?

284
 
 
The original post: /r/datahoarder by /u/manzurfahim on 2025-07-24 06:20:00.
285
 
 
The original post: /r/datahoarder by /u/wooper91 on 2025-07-24 02:30:51.

Hey all,

In short I’m going to be visiting my family back in their home country it’s been a while since I’ve gone almost a decade. Part of what I wanted to do was digitize the family photos my mom has in our house over there. These photos go back all the way to my great grand parents at least.

I’m curious what sort of scanner/ digitizer I can use to scan them and put them in my laptop. Only real requirement is that it has to be portable since I’ll be flying

If you need any more details plz let me know I’ve never purchased something like this so I’m not sure what else I’d need to consider

286
 
 
The original post: /r/datahoarder by /u/x23_wolverine on 2025-07-23 23:05:37.

Hi all, I want to build/buy a nas system. I want two things out of it, and am having a hard time understanding how the plex system interacts with works with storage of non-plex things. I have a fairly large collection of 3D files that I want to back up, and be accessible from all of my computers, as well as some music, pictures, videos, files etc. I also want to build a plex server and rip my large dvd collection to it eventually and have that streaming to the tv's throughout the house. Do I need two different NAS to do this? one that has plex running, and one that is more of just a storage system? Can Plex run on the Ugreen OS, do I need to install a third party NAS OS to get Plex to run?

287
 
 
The original post: /r/datahoarder by /u/CreepyLui on 2025-07-23 22:42:48.

Anyone know of an IG downloader similar to myFaveTT? More specifically, I'm looking for a downloader that makes an HTML of followed users, similar to what myFaveTT does.

288
 
 
The original post: /r/datahoarder by /u/Broad_Side on 2025-07-23 21:41:50.

I know this cuts against the grain a little, but any had expericance with this card and windows storage spsaces? oir even windows in general?

289
 
 
The original post: /r/datahoarder by /u/BDTrey8 on 2025-07-23 19:38:33.

I have a lot of random data stored on a bunch of different devices(about 5-8tb of photos, videos, 3d files, etc.).I want to get everything centralized, and backup the important stuff. Easily accessible long term storage essentially. I was just going to get a nas, but im guessing that’s overkill. I landed on a Raid enclosure, but everyone says raid software is better. So would it make sense if I just did both raid software and the enclosure? What should I look out for if I did got the Raid Sw and HW route?

290
 
 
The original post: /r/datahoarder by /u/Pessimistic_Gemini on 2025-07-23 19:02:49.

So I was looking around the web, all the while backing up one of my drives into another 4TB drive and I was looking through DiskPart to see what the Default Allocation Size was for my 22TB drive. Apparently it was around 8192K while its Offset was at 1024 KB.

I was looking to format it again to a different Allocation size to see if it would not result in such a bloated file size for one of my copied drives but wasn't sure which one would work for it. As it is currently, the options are 2048K, 4096K, 8192K (Default), 16384K, 32768K.

If anyone with this said drive has any idea which is recommended, I would like to know whenever you have the time. Thanks.

291
 
 
The original post: /r/datahoarder by /u/ToneThugsNHarmony on 2025-07-23 18:42:45.

I have a hard drive that is about 5 years old that I used when I primarily used a MacBook Air. I believe that it was formatted in EXFAT, but I’m not certain.

Haven’t used the hard drive in years, but now I need to and don’t have a computer at home.

Rather than purchasing a new Apple product, I would prefer to just get a cheap Chromebook, but like I said I’m not 100% certain of the formatting, so…

  1. Is there a way to figure out the formatting of the hard drive without buying a new computer?

And

  1. If I buy a Chromebook is it likely to be able to read the hard drive?
292
 
 
The original post: /r/datahoarder by /u/SwingDingeling on 2025-07-23 18:34:01.

What's the difference? And what decides if it ends up LQ or not?

293
 
 
The original post: /r/datahoarder by /u/jermaine13 on 2025-07-23 18:29:37.

Hello guys!

Ok, so I am backing up DATA from my old HDD to my new HDD (external 26TB for backup)

Everything seemed fine, but when I took the external HDD to another computer, all data I backed up was WIPED. :)

I was able to recover some from using chkdsk x: /f command

Problems I did (my fault for sure):

  1. I put a fan in the same surge protector near the hard drive and it disconnected the hard drive (I think this is what caused the files to be wiped afterwards)

  2. I tipped over the hard drive like a tard. Although, this didn't seem to cause issues somehow.

  3. I moved the hard drive's position while it was transferring files just playing/fidgeting with it like an idiot.

  4. Fan speed may have been too high blowing air on the hard drive causing vibrations PERHAPS (or fan was placed too close).

I think #1 is the main issue, because I just plugged the fan back in the same outlet (like a tard)

Ok, so:

What I want to know is:

What can I do that this doesn't happen again.

It could be so many things.

The hard drive that I'm transferring from is also failing (yellow condition in Crystal Info) hence the backup, which I didn't do properly because I did cut+paste (like a moron).

Any help or advice for me?

Thank you!!!

294
 
 
The original post: /r/datahoarder by /u/sasukefan780 on 2025-07-23 17:20:37.

Hello all!

I am student working at an archive at the moment and have been tasked with finding a new external hard drive solution for our data. I am a bit of a noob datahoarder, so I would really appreciate some advice. Don't worry, we also use a cloud based solution to store our data, these hard drives are for redundancy.

Currently we are using over 20 G-DRIVE 4TB drives (model number: 0G02537), and while these drives are still quite functional, they are getting quite old.

I have looked into both external HDD and external SSD options, but thanks to some of the posts on this subreddit about SSD cold storage issues, I believe that an external HDD will serve my interests best.

Based on reviews from PCMAG, wired, and many others that I should have written down, I have been leaning towards recommending we use either the "Western Digital My Book 24TB" or "Western Digital Elements Desktop HDD Storage 24TB".

Please let me know if you have any positive/negative experiences with these hard drives, or if you can recommend an alternate hard drive for me to consider.

Thank you all!

295
 
 
The original post: /r/datahoarder by /u/HAFSIX on 2025-07-23 16:35:57.

new to this space.......get it, space....data ;)

Anyways enough with the dad jokes, my main question is about a LSI 9300 16i unraid card, are they just plug and play with windows 11?

I ask because I got 12x 250gb SSD and I was looking to make it into one large array for a steam library, if there is a better way to do that then a LSI 9300 16i im all ears, never connected more then a couple of drives in one PC at any given time so im personally in untested waters and looking for advice from you guys.

296
 
 
The original post: /r/datahoarder by /u/suralya on 2025-07-23 15:55:03.

Hey everyone, so I could really use some support. been a long time lurker but havent really dipped my toes into any serious hoarding. but I had a pool of 7 drives ( a mix of internal and external )

|| || |ST2000DM008-2FR102|HDD|2.00 TB|1.82 TiB|Healthy|OK| |ST8000DM004-2U9188|HDD|8.00 TB|7.28 TiB|Healthy|OK| |SAMSUNG HD103UJ|HDD|1.00 TB|0.93 TiB|Healthy|OK| |WDC WD4001FAEX-0|HDD|4.00 TB|3.64 TiB|Healthy|OK| |OCZ-VERTEX3|SSD|120 GB|112 GiB|Healthy|OK| |CT2000T500SSD5|SSD|2.00 TB|1.82 TiB|Healthy|OK| |WDC WD20EZRX-22D8PB0|HDD|2.00 TB|1.82 TiB|Healthy|OK|

All of these were a mixture of ancient drives to new drives a few months old in the process of transferring data and consolidating. Most of my essential data is backed up on separate backup but I am absolutely gutted right now. Was tinkering around with the spaces, ended up duplicating the space names, corrupting the pool and losing the space. My dumb fault and ill mourn in time.

but how can I prevent that from happening again? how can I learn from my mistakes? I dont want to touch Windows Storage Spaces again. Ill invest in newer drives if I have to, and research DAS and raids and all that other stuff. im a sponge willing to absorb all the information I can. I am assuming all my data is gone and ill have to spend the next few weeks trying to recover what I can from the drives (im not formatting them or using them)

I know my thoughts are everywhere and I apologize but my dad taught me almost 40 years ago when you mess up, there is no shame asking for help. so please..help. im in Canada, im a broke disabled dad. but im down to learn.

297
 
 
The original post: /r/datahoarder by /u/MelodicRecognition7 on 2025-07-23 15:16:25.

This is a follow-up to my previous thread https://old.reddit.com/r/DataHoarder/comments/1hytjia/transcend_ssd230s_4gb_teardown_and_cooling_upgrade/m6k5ifi/

There are just too many posts so I've decided to start a new thread.

TLDR of the old story: Transcend SATA SSD model 230S has faulty firmware and bad cooling desing which leads to severe throttling, complete drive hangs and SATA link resets, and buildup of reallocated sectors. Highly likely the core issue is the buggy firmware rather than a bad cooling, so if you have firmware older than 22Z4X4IA then you should update as soon as possible. WARNING: firmware update will wipe the drive so you will lose all your data, make a full backup prior to updating. Use the official Transcend software to make a bootable USB drive, if you will not succeed then you could try the extracted firmware updater (Linux only): https://old.reddit.com/r/DataHoarder/comments/1hytjia/transcend_ssd230s_4gb_teardown_and_cooling_upgrade/mysc64p/ If your warranty is expired already then you should also modify the cooling by connecting the chips with the aluminum drive casing with a thermal pads. If your warranty is still valid then you might try to RMA the drives, especially if you have many reallocated sectors.

And now about the new model: I have returned 4 out of 8 drives (the 4 modified drives are obviously not eligible for RMA) and Transcend have sent me a new drives for replacement. The old drives were manufactured in the summer 2023, the new drives are manufactured in the summer 2025, just 1 month ago. The old drives have serial numbers starting with letter H (H690......), the new drives have serial numbers starting with letter J (J455......).

There is one difference in the SMART: the old drives have attribute "Offline data collection status: (0x80)" Auto Offline Data Collection: Enabled. The new drives have it disabled: "Offline data collection status: (0x00)" Auto Offline Data Collection: Disabled. Possibly this is a workaround of the bug in the firmware, you might want to disable it on your drives too.

There are no more differences in the SMART. Also there are no differences in the chips, the smi_flash_id tool by Vadim Ochkin ( http://vlo.name:3000/ssdtool/ ) report this:

Controller : SM2259AB
Bank00: 0x45,0x48,0x98,0x3,0x76,0x6c,0x0,0x0 - Sandisk 112L BiCS5 TLC 16k 1024Gb/CE 1024Gb/die 2Plane/die
...
DRAM Size,MB          : 2*512
DRAM Vendor           : Samsung

— SSD230S 4TB has just 1GB DRAM but it is still a TLC drive while other manufacturers have started to put QLC chips into their large capacity drives.

The old drives had a big hole between the SATA connector and the drive casing and you could see that the chips inside do not touch the aluminum casing: https://files.catbox.moe/ihgi89.jpg https://files.catbox.moe/ev1ptn.jpg

The new drives have a slightly modified casing with a different SATA connector which does not allow to see what's inside the case: https://files.catbox.moe/fi0x09.jpg but there are four small holes and if you shine a light into the left ones you will be able to see the chip through the right holes, the chip is still not connected to the aluminum casing: https://files.catbox.moe/6drot0.jpg https://files.catbox.moe/uwtktg.jpg

This makes me think that the source of the original problem was a bug in the firmware rather than a bad cooling, as Transcend still does not put thermal pads inside the SSD.

And a few comments: first of all I want to give a shout out to Transcend support team, the communication and RMA procedure was smooth and much better than I've experienced with some other brands, even "enterprise" ones (looking at you HPE)

As I've wrote in the previous thread,

Well, these drives are cheap for a reason, I guess no more Transcends for me too.

but then I've recalled that Samsung manufactured 870 Evo's using broken chips and shipped 990 Pro's with faulty firmware that was quickly killing the chips over time (same as Transcend SSD230S lol), that HP and Sandisk shipped their enterprise drives with a killswitch in the firmware that wiped the customers data after 40'000 power on hours, and that WD from being the best drives manufacturer turned into Aliexpress-level joke brand, and therefore I've decided to give Transcend drives another chance. I plan to build a storage array with 4x PCIe v4 NVMe drives model 250S, if you know any issues or nuances about these drives then please tell in the comments.

298
 
 
The original post: /r/datahoarder by /u/mtlynch on 2025-07-23 14:40:49.
299
 
 
The original post: /r/datahoarder by /u/thinvanilla on 2025-07-23 13:18:55.

Got my NAS a year ago and put 3x8TB drives in it set to SHR1 (Synology's RAID5), and recently started running out of storage so got 2 more 8TB drives and a plan to buy an 8 bay unit so I could make use of SHR2 (RAID6) and do more upgrades later on.

But I found out people try to stagger their drive purchases so it's less likely that two will fail at the same time. Given there are 3 drives which are from the same batch and age, should I replace one drive with one of the new drives I bought, put the old one on the shelf, let the new drive get some age (I could probably only give it 1 month of use though). And then once I've got the 8 bay I can add the old drive back into the array?

And by "replace" I mean put a drive in the empty bay, click on replace drive, it transfers the data across from one drive and starts using the new drive; it doesn't need to rebuild the database.

That way two drives (06/2024) are the same age and same wear, one drive is the same age (06/2024) but a bit less wear, and two drives are the same age (06/2025) but different wear. And yes I have backups so if I had 3 drives fail I could restore, but obviously want to avoid that. They're all WD Red Plus drives so I think they're pretty reliable.

300
 
 
The original post: /r/datahoarder by /u/iavine on 2025-07-23 11:02:34.

I am an old techie who hasn't kept up with anything recent and I am currently planning a new system* to keep my family's data safe-ish.

There are 4 laptops; aside from mine, they are used by non-tech kids. I plan to have an external ssd for each that will get differential backups made to them. Software tbd; they run windows, I run linux. I want to have a solution to backup those ssds that is minimum hassle/action required.

Key point: we will be travelling in a caravan so we have power and space limits and whatever it is will spend most of it's time powered down and carefully packed up to minimise risk. I am going for ssds for the first backup line as they do not need an external power source. Also, there won't be a reliable internet connection so cloud anything is right out.

Currently, I am thinking a mini PC with a appropriate size das that auto-runs a sync whenever one of the ssds (or a memory stick/card) is plugged into it. Is there an enclosure that already does this? Or any suggestions for a better method? I am wanting to have a setup where a) the kids just plug their ssd into their laptops semi-regularly (and their backup software does it's differential thing) and then b) I semi-regularly boot up this system and plug in their ssds (and it does it's sync thing). If the mini PC is the go, I am fairly sure I can navigate finding the right distro and automating the sync but any pointers are welcome.

* right now, backups are haphazard for all of us as the risk of losing or damaging our computers is fairly low. When we move into the caravan, I reckon the risk profile changes a lot though! I have a stack of old hdd with close to two decades of data-hoarding that need to be consolidated but most of that will be into offline mode and left with family for the next year. I probably don't neeeeeeed my stash of early noughties miscellanea when travelling, right? Right?

view more: ‹ prev next ›