It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
2501
 
 
The original post: /r/datahoarder by /u/jbwhite99 on 2025-03-28 17:10:29.

https://www.tomshardware.com/software/windows/new-advanced-filesystem-format-option-found-in-windows-11-preview-build-refs-supports-up-to-35-petabytes

I figured this forum probably has some users that could take advantage of this...

2502
 
 
The original post: /r/datahoarder by /u/elkinm on 2025-03-28 17:00:02.

I have an external 2TB plater drive that is starting to fail. I found that new 2TB drives cost about $70 while a 2TB SSD only costs $100 or not much more so why not get an SSD.

So what is the cheapest 2TB external SSD? Here are some I found for $100.

https://www.amazon.com/gp/product/B08FSNKNSV

https://www.amazon.com/gp/product/B0DWHV2QB6

I know I can get faster drives by spending more, but I don't care about speed as this is a platter drive replacement, I mostly care about price and reliability. Which of these drivers would be best or is there another one I can find for even less money. And I also don't need a bigger drive as I have other larger drivers for that. Thank you.

2503
 
 
The original post: /r/datahoarder by /u/MrSneaki on 2025-03-28 16:31:37.

Hi, TIA for any insight!

Want to find a good external SSD for file storage. Won't be using it for any active tasks, only for storage, so it doesn't need to be fast. 0.5 TB ought to be more than enough room for the things I need to store.

Any suggestions, either brands or models? Hoping I can find something budget-friendly, since speed isn't a factor.

Edit: Would an HDD potentially be a better / cheaper solution for long term storage? Is there any definitive idea whether HDD or SSD would last longer?

2504
 
 
The original post: /r/datahoarder by /u/Daniel_Delgado on 2025-03-28 15:48:10.

Hello, I want to encrypt my 4TB and 18TB HDDs, Seagate Iron Wolf and Exos, Windows 10 as my OS,

I saw video on youtube that encryption could sugnificantly affect the write performance of encrypted HDD,

and want to know whether its true or not before i encrypt my disks.

I want to encrypt the entire drives.

I am planning to use Vera Crypt but I am also open to suggestion of encryption software.

I need to transfer relatively large amounts of data (100s GBs / TBs) across those disks

Thanks for all the answers

2505
 
 
The original post: /r/datahoarder by /u/Keeper-Name_2271 on 2025-03-28 15:46:21.

how do i even know what i will need? i plan to download all bollywood songs. but what next? i've few days before internet lockdown occurs so...

2506
 
 
The original post: /r/datahoarder by /u/tehDude67 on 2025-03-28 15:36:54.

Newbie I just bought my 1st 2 hard drives for my Nas and I read you could run a test on the hard drives and if the did not pass server part deals would exchange them. Could someone please help as which program I am supposed to have thank in advance

2507
 
 
The original post: /r/datahoarder by /u/User_3614 on 2025-03-28 14:29:04.

I used to think that default default allocation unit size was 4 kB (4096 B).

I have a recent drive that was formatted with 64 kB allocation unit size, I would currently like to reformat it with the smallest unit size.

In current menu in Win 10 Pro, the smallest option I select is 8 kB (so 8192 B), and if I select the "Default " size it also results in 8 kB.

Is it something that changed in OS? Is it something that depends on the drive? (It's a 26 TB drive by the way, file system is NTFS.)

Also, why can I selected smaller unit sizes?


Please stay focused: I want to know if default size changes, what does the unit size choice that is offered depend on (OS? Drive?), why I can't select smaller. I'm not interested in yet another debate on the pros and cons of various sizes.

2508
 
 
The original post: /r/datahoarder by /u/luxfc on 2025-03-28 14:10:36.
2509
 
 
The original post: /r/datahoarder by /u/thebigscorp1 on 2025-03-28 13:23:22.

One thing I always feel bad about is altering something that has been a part of my NAS for years because it's sorta erasing history. I also like the idea of seeing the evolution of something over time (think like a Minecraft build timelapse). It's just naturally satisfying.

2510
 
 
The original post: /r/datahoarder by /u/Fulcro97 on 2025-03-28 11:59:45.

Hi guys, looking for a bigger storage solution to store my videos. Right now I have three WD 2.5” portables drives (2,4,5TB) but it’s not enough anymore. I found this Seagate for 340€ and it seems one of the best deal out there right now (all my WD will become a backup solution) and I wanted to ask if it’s a good drive or it’s better to invest in WD (the same storage is 100€ more expensive here in Italy) or something else (I’m planning a NAS Synology later this year if money will allow)

Thank you in advance guys

2511
 
 
The original post: /r/datahoarder by /u/Eisenstein on 2025-03-28 10:04:00.

Where did it come from?

A little while ago I went looking for a tool to help organize images. I had some specific requirements: nothing that will tie me to a specific image organizing program or some kind of database that would break if the files were moved or altered. It also had to do everything automatically, using a vision capable AI to view the pictures and create all of the information without help.

The problem is that nothing existed that would do this. So I had to make something myself.

LLMII runs a visual language model directly on a local machine to generate descriptive captions and keywords for images. These are then embedded directly into the image metadata, making entire collections searchable without any external database.

What does it have?

  • 100% Local Processing: All AI inference runs on local hardware, no internet connection needed after initial model download
  • GPU Acceleration: Supports NVIDIA CUDA, Vulkan, and Apple Metal
  • Simple Setup: No need to worry about prompting, metadata fields, directory traversal, python dependencies, or model downloading
  • Light Touch: Writes directly to standard metadata fields, so files remain compatible with all photo management software
  • Cross-Platform Capability: Works on Windows, macOS ARM, and Linux
  • Incremental Processing: Can stop/resume without reprocessing files, and only processes new images when rerun
  • Multi-Format Support: Handles all major image formats including RAW camera files
  • Model Flexibility: Compatible with all GGUF vision models, including uncensored community fine-tunes
  • Configurability: Nothing is hidden

How does it work?

Now, there isn't anything terribly novel about any particular feature that this tool does. Anyone with enough technical proficiency and time can manually do it. All that is going on is chaining a few already existing tools together to create the end result. It uses tried-and-true programs that are reliable and open source and ties them together with a somewhat complex script and GUI.

The backend uses KoboldCpp for inference, a one-executable inference engine that runs locally and has no dependencies or installers. For metadata manipulation exiftool is used -- a command line metadata editor that handles all the complexity of which fields to edit and how.

The tool offers full control over the processing pipeline and full transparency, with comprehensive configuration options and completely readable and exposed code.

It can be run straight from the command line or in a full-featured interface as needed for different workflows.

Who is benefiting from this?

Only people who use it. The entire software chain is free and open source; no data is collected and no account is required.

Screenshot


GitHub Link

2512
 
 
The original post: /r/datahoarder by /u/DoubleDickWilly on 2025-03-28 09:47:34.

I have a pc with an SSD for OS/games/programs and a 2TB HDD for documents/media. I have a 4TB HDD in an external enclosure which I want to use as backup for the 2TB HDD. I intend to manually make a backup once per month.

I'm lost on what free software to use to keep a good backup. Any recommendations on what I should use?

I'm also confused on the difference between image backups and loose files. Is there any benefit to an image backup if the drive I want to backup doesn't have my OS on it?

2513
 
 
The original post: /r/datahoarder by /u/DearPlankton on 2025-03-28 05:56:38.

I have 3 hard drives in my PC case and I want a cold storage backup solution for them. Is a 2-4 bay DAS overkill for my use case? It's nearly 2-3x the price of a docking station. Are there any reliability concerns from one to another?

2514
 
 
The original post: /r/datahoarder by /u/OmegaAOL on 2025-03-28 04:46:38.

I have a Seagate laptop hard drive. Relatively new, power on about 500 hours. I am planning to use this drive to store old backups of my C drive (over 2 years old).

These backups would be nice to have but aren't the most valuable thing in the world. I access them maybe once in 3 months. I want to move them off my primary data drive because they take too much space in the first place and they take doubly as much space on the backup of that drive.

I am planning to just leave these in cold storage and not touch them for a long time. Basically like a time capsule of sorts. Can anything happen to a unused hard drive?

My current backup schedule is once a day incremental to primary data/backup drive, and once in 2 days incremental to secondary backup drive. Once a month full for both. My old backups are taking up a lot of space I can use for other more valuable things so I would like to offload them out of my backup chain.

2515
 
 
The original post: /r/datahoarder by /u/Chi90504 on 2025-03-28 04:32:03.

When it comes to Duplicate file finding programs I can't seem to find one that does quite what I want

Duplicate Cleaner Pro comes pretty close and it's what I'm using right now but I'm hoping someone here might know one that does what I want properly

basically I want it to treat a 'hard linked' file for what it is a single file in multiple locations on the hard drive

Duplicate Cleaner Pro can file duplicate files and make them into a single hard linked file and that's not uncommon but where it falls down and where every other program I can remember trying falls down is what happens in future searches ... Duplicate Cleaner Pro has 2 settings 1. Ignore Hard linked files meaning if I've previously hardlinked 3 copies of the same file into a single file in 3 places but then for some reason download that same file again into a 4th location it won't detect the duplicate file because one of the two files is hardlinked and thus with that setting the program completely ignores it setting 2. Don't Ignore Hard linked files meaning it 'finds' 4 duplicate files when there's only 2 with one of the 2 being in 3 locations it will also even before the new copy of the file keep finding the hardlinked file as duplicates against itself twice over pretending the single file is 3 separate identical files

2516
 
 
The original post: /r/datahoarder by /u/Few_Frosting_5343 on 2025-03-28 03:40:33.

Preferably from social sciences. Please help!

2517
 
 
The original post: /r/datahoarder by /u/WarmFinding662 on 2025-03-28 02:15:19.

just curious! Maybe this is information I could find somewhere but I am curious.

2518
 
 
The original post: /r/datahoarder by /u/Dead_Cowboy_ on 2025-03-28 01:36:45.

Hello there,

I am currently in the process of reworking my network storage. As it happens I am now in the need for some new drives. As an avid used buyer I was also looking into some older HDDs to do the job. However I am now wondering if a new drive would not give more bang for my bug.

My main problem however is that I don't actually need That much storage of high quality. I have a bunch of data I would not loose any sleep over missing, so I will just slap that on some drives and call it a day. For the data I actually care about 4TB would be plenty and for those small sizes new drives are pretty expensive per TB. So I wanted to run some older 3x2TB drives in RAID 5. I even found an offer for 6x2TB drives for 100 bugs, giving me plenty of spares at around the price for a new 4TB drive. However these drives are around 10 years old. Similar things hold true for some other platters I found with a bunch more than 50k hours, so those spares will likely be needed.

The usage however will be pretty light. This is not stuff I need to read or write to often, so most of the time the drives will be off. On average I access it maybe once a week or there about.

Eventually I will also setup a true, independent backup for the data, so I am not terrified of either the Raid nor single platter failing, but I want to delay playing that card for as long as possible.

With that said, the issue I am optimizing for is reliability per cost. I don't really care about speed. What could be exprected to last longer, redundancy with the life left in three crap HDDs with some spares or one fresh new drive? Or am I just too cheap and sould bite the bullet to fork over the necessary cash for a RAID of new disks?

Thank you for any advice :)

Tl;dr

What is more cost-effective as reasonably safe storage, bad drives in Raid or one single new drive?

2519
 
 
The original post: /r/datahoarder by /u/TTVRaptor on 2025-03-28 01:21:33.
2520
 
 
The original post: /r/datahoarder by /u/Chygoda955 on 2025-03-27 03:59:37.

Hey so I just had someone selling me an lto7 drive supposedly external use ready. Just arrived and it's very clearly not and external drive and found it's mean to be put into a quantum robot library. Any chance there is an enclosed I can get or rig up to make this useable separate from the library?

2521
 
 
The original post: /r/datahoarder by /u/artemis73 on 2025-03-28 01:18:47.

Hello,

Do any of you fine folks know of an app that I can use to merge similar folders I've got littered all over my NAS?

At one point I had 3 external drives that have similar files and folders backed up over the years with maybe some sublte differences. I imported all the data from these drives with the hope that I can curb the mess on the NAS and then use these drives as an external backup medium again.

Do you all know of an app that I can use to help me comb through about 8 TB of data and maybe merge similar files and folders? Any help would be appreciated.

Thanks!

2522
 
 
The original post: /r/datahoarder by /u/skynetarray on 2025-03-28 01:16:54.

I want to collect some knowledge from YouTube with Pinchflat.

I’m searching for YouTube channels that provide knowledge and information, whose target is to educate, especially those who go into deep detail and wrap it up nicely.

Kurzgesagt or Veritasium are probably good examples for what I mean, also melodysheep.

Do you have some more suggestions?

2523
 
 
The original post: /r/datahoarder by /u/RepentHarlequin73 on 2025-03-27 23:53:38.

What ocr app can reliably process pages featuring a mix of roman letters, japanese kana and kanji, korean hangeul and hanja, and chinese hanzi? I'd prefer Macos apps or scripts. Thanks !

2524
 
 
The original post: /r/datahoarder by /u/J00433996 on 2025-03-27 22:12:16.

What’s the difference between verbatim m discs and such. I read a lot about verbatim being bad but nothing on the mdisc website linked on Wikipedia. Is it better or the same or what? I just want a disc I can store a bit of data on that’ll last a long time in reasonable conditions hands off. Ideally like 100-200 some years. Cold storage.

2525
 
 
The original post: /r/datahoarder by /u/drupadoo on 2025-03-27 21:26:35.

Run a VPN server and VPN in? Open the ports and mount directly? Just use a webui like owncloud instead of mounting?

Also want to prevent it from constantly trying to reconnect when I am remote, but connect automatically when home

view more: ‹ prev next ›