It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
2751
 
 
The original post: /r/datahoarder by /u/KongoOtto on 2025-03-09 11:28:56.

I#m thinking about buying some 20 or 22 TB of the Toshiba MG10F Series.

Any thoughts or experience with those drives?

2752
 
 
The original post: /r/datahoarder by /u/GlaciarWish on 2025-03-09 09:21:29.

Hello everyone,

I deleted one of my media folders by mistake.

Thankfully no impact as I preform weekly snapraid sync and scrub.

While restoring data I noticed inodes are not being restored for hardlinks creating duplicate remuxes in my case. Snapraid is not restoring the inodes unfortunately it seems.

Going forward, I will probably start using syslinks.

My only concern I have many files that matches torrents by 99.9% then download slightly different media - I had no issues with hardlinks setup.

Will this work with syslinks when file download extra media at 99.9%?

I am worried another drive will crash or upgraded (in process) then I will end up with many hardlinks not linking anymore and creating dupes which is already stressful for me.

I know there is Apps like jdupe but I am not sure how accurate are they?

Fyi only I am talking about +6000 hardlinks between cross-seed and Plex.

2753
 
 
The original post: /r/datahoarder by /u/ViperSteele on 2025-03-09 04:44:32.

Data Hoarding Setup Review

Hi, I'm considering my first serious data hoarding setup. I'm planning on using a DAS because I really only use my computer in my home office. I don't watch our TVs much and if I do it something me and the wife want to watch together. I have a Philips 34" that I enjoy watching shows and reading on. So I don't think a NAS is something for me. And I don't have the knowledge or time to jump into a NAS setup.

Equipment

  • MacBook Air M2 1TB
  • DAS: Terramaster D2-320. Reasons: Easy to use for beginners, Thunderbolt connection for fast speeds with my MacBook Air, compact and aesthetically pleasing on my desk
  • 2 × 8TB Western Digital Blue HDD. Reason: Good price point and 8TB seems like a reasonable starting capacity. Reason: Will use Backblaze for redundancy/backup

Backup Strategy

  • Backblaze Personal $100 bucks a year to back up both my MacBook and external drives
  • RAID 0
  • Considering an additional external drive for Time Machine backups
  • This provides redundancy if any of my 8TB drives fail

What do you think of this strategy? Any critiques or personal anecdotes about similar setups would be appreciated. I haven't purchased anything yet, so I'm open to advice or scenarios I might not have considered. Thanks in advance!

2754
 
 
The original post: /r/datahoarder by /u/artesons on 2025-03-09 04:26:46.
2755
 
 
The original post: /r/datahoarder by /u/Empty_Use6095 on 2025-03-09 03:36:40.

Hello to all i thought that this would be the best place for me to ask this kind of question. I have just picked up a buffalo terastation pro NAS its got 4 bays and is pretty old. i know it will not work with windows 11 but is there a way i could get this to work with a linux distro. Any help will be greatly appreciated

2756
 
 
The original post: /r/datahoarder by /u/OptionSuspicious3428 on 2025-03-09 02:25:38.

Looking to create my DAS on the cheap, this housing a good deal? What other options should i be looking for?

https://preview.redd.it/ejpcaf5arkne1.png?width=1765&format=png&auto=webp&s=b28fae9a27ca59d9c451132854f04e0680c627a4

2757
 
 
The original post: /r/datahoarder by /u/FruitLong8561 on 2025-03-08 14:34:05.

Hi! I was wondering about the best methods used currently to fully digitize a scanned book rather than adding an OCR layer to a scanned image.

I was thinking of a tool that first does a quick scan of the file to OCR the text and preserve images and then flags low-confidence OCR results to allow humans to review it and make quick corrections then outputting a digital structured text file (like an epub) instead of a searchable bitmap image with a text layer.

I’d prefer an open-sourced solution or at the very least one with a reasonably-priced option for individuals that want to use it occasionally without paying an expensive business subscription.

If no such tool exists what is used nowadays for cleaning up/preprocessing scanned images and applying OCR while keeping the final file as light and compressed as possible? The solution I've tried (ilovepdf ocr) ends up turning a 100MB file into a 600MB one and the text isn't even that accurate.

I know that there's software for adding OCR (like Tesseract, OCRmyPDF, Acrobat, and FineReader) and programs to compress the PDF, but I wanted to hear some opinions from people who have already done this kind of thing before wasting time trying every option available to know what will give me the best results in 2025.

2758
 
 
The original post: /r/datahoarder by /u/uboofs on 2025-03-09 03:17:40.
2759
 
 
The original post: /r/datahoarder by /u/throwaway69xx420 on 2025-03-09 03:06:10.

Howdy peeps

I've began my foray into data hoarding. I'm at the point where I need to upgrade from 12tb! I recently bought the Seagate Expansion 24tb external drive from Best Buy for $279. I currently only have a Dell Optiplex acting as a server for the usual stuff.

Curious what are the pros and cons of chucking? Should I chuck before/after the warranty on the Seagate drive? Some concerns (not sure how accurate these are) are that I will void the warranty once I shuck so it brings up a question of whether I should shuck now or wait until the warranty expires. Another concern is that I might break my drive in the process. Any advice and tips would be appreciated! Thank you friends!

2760
 
 
The original post: /r/datahoarder by /u/Forward-Inflation-77 on 2025-03-09 02:02:30.

I am just starting the process of digitizing my family photo albums. I realize this will be a project that will probably take me months if not years. Not really sure how many photos I have to do, guessing easily in the thousands.

I have started out using scan speeder and doing 4, 5 or 6 pictures at a time and saving as a TIFF file using a Brother 2900 flatbed scanner but didn't realize could only do a few scans on the free edition. Don't mind spending the $30 for a 1 year license. But I realize it's possible this may not get done in a years time. Even doing multiple at a time, still time consuming. I know there are photo scanners specifically made for projects like this but they are several hundred dollars. Not sure if I want to invest that much just for a one time project. Need to look into a service that does this, for those that have used a service, what did it cost? Do places like walmart do stuff like this? Or will it take a specialized service? I have used auto splitter but I liked scan speeder better. Of course, would have to pay for auto splitter as well and that is a 2 year license vs the 1 year on scan speeder.

When buying a photo scanner, I have read that it is not good idea to use the ADF on regular printers to scan them, there is a chance it could damage pictures. Isn't that how the photo scanners scan pictures, being fed through machine? Or are the photo scanners more delicate than your typical AIO printer?

For the pictures that have writing on the back, how does one go about preserving that? I know I could scan both front and back but that would make 2 different photos. How do you keep track of which one goes with which picture? Would naming the picture with what it says on back a good way to go about that? One time consuming thing about this is most of the pictures are in sleeves instead of just boxes.

2761
 
 
The original post: /r/datahoarder by /u/DougPedersen on 2025-03-08 23:50:48.

I pulled the black plastic case apart, and found the 12TB drive. Disconnected the USB interface board. Then connected it to my Dell Desktop .. it worked GREAT!! Any thoughts on if the USB interface board could have been the only issue? Or maybe the drive is ready to fail again? I tried plugging it in via USB before pulling it apart, and the computer could not even recognize it.

2762
 
 
The original post: /r/datahoarder by /u/jku2017 on 2025-03-08 23:18:23.

Out of 4 drives in got from Amazon, one makes a lot of repeated noises and never initializes. Are ironwolfs good quality?

2763
 
 
The original post: /r/datahoarder by /u/Alpha_Datura on 2025-03-08 22:03:49.

It doesn't matter if it is Acronis True Image 2015, 2016, or 2021, it seems to sporadically use sector by sector, or it doesn't do it at all. Can anyone help me figure out how to specify it reliably? I usually use the 2016 version, but instructions for any version would be great.

Thank you for your time!

edit: If there is a version that has a checkbox for sector by sector copy, I would like to know which one to get!

2764
 
 
The original post: /r/datahoarder by /u/Warcraft_Fan on 2025-03-08 21:51:53.

Title should have been reworded. Sorry the title sounded too much like tech support help not a valuable information post

I have 2 of these drive and I noticed some oddity with it. If they are connected to my motherboard's SATA ports, they stop working after sleep. They still spin up but any attempt to access the drive gets "can't find file at specified location" error.

Asus Prime x570 Pro. I've tried updating SATA driver, changing ports to AHCI, hot swap, etc and simply nothing will work after I sleep the PC. Using Windows 11, up to date

But when I moved the drives to Dell H310 (cross flashed to LSI IT firmware), it always worked fine after sleep. I tried to ue Google and got a few results on MG drives, they seem to not like Asus SATA ports for some reason.

Just passing info if anyone else had issues with MG drives (or any other drives) with Asus motherboard, and you have trouble accessing them after sleeping, get a HBA and use that instead of onboard SATA.

2765
 
 
The original post: /r/datahoarder by /u/PsiNexus on 2025-03-08 21:48:46.

Today was UPS battery swap-out day, and when I powered my system back up, one of my 3 shucked WD drives was no longer detected by my server by my 4 bay USB enclosure. I pulled all 3 drives and put them into their original WD Easy store enclosures, and again only 2 of the 3 drives were detected (this time by my laptop, not the server). When looking at lsblk, the problem drive is reading as 0 GB and was not spinning up on connection, whereas the other two drives immediately spun up when I plugged them in via USB to my laptop.

The strange thing is that when I installed the non-functioning drive in my gaming tower it spun up and was immediately detected, with data accessible. A few power cycles confirmed it would keep spinning up. Smartctl does not show any red flags and the short test passes without issue. However, it still does not spin up over USB.

Does anyone have any ideas about what might be going on? The drive is 5 years old, so failure isn't unlikely, but it's confusing that it works in the desktop. I'm not concerned about data loss at this point, I have backups and it's a parity drive for SnapRAID anyways. And to get ahead of things, I agree that USB is not the way, but sometimes it's what you have, can afford, and has been reliable for 7 years.

Thanks for your time!

2766
 
 
The original post: /r/datahoarder by /u/againstmachinations on 2025-03-08 20:35:39.

I downloaded a website using SiteSucker and so it has created a folder with an index html and I can view the website offline just how it is originally.

I'm now wondering if there's a way to search the posts (it's an old blog) for certain keywords that I need?

I tried to install YaCy and DocFetcher but unfortunately both do not work on my iMac (i have M1) - i tried all the configuration and installed Java and other things but it's simply not working and I've hit a dead end.

I don't want to use grep - ideally I want the search results to be viewable on the browser as well or something close to it if at all possible.

I am not a developer and have limited understanding of this - I am just going by chatGPT's help at this point. It suggested I download Recoll but the download instructions seem too complicated.

Wondering if anyone has a suggestion? The threads I've read are from way back (that's where I found out about YaCY and DocFetcher).

Thank you.

2767
 
 
The original post: /r/datahoarder by /u/Lord_Kronos_ on 2025-03-08 19:11:17.

I've decided recently to get another External Hard Drive and that I've chosen WD (so far). However I saw that a 2TB My Passport drive is 76$, but their 2TB drive for Chromebook is 62$. Does anyone know why the one for Chromebook is cheaper? It has good reviews so far, albeit not as many as the My Passport one.

If I can save the 14$ and get the 2TB drive for Chromebook then I'd love to, unless there's a reason why it's cheaper.

2768
 
 
The original post: /r/datahoarder by /u/mike12ophone on 2025-03-08 18:47:55.

Im working towards a 3-2-1 solution starting with cloud storage. I'm not sure what service(s) to look for to accomplish my goals. Or even if im on the right path with my goals. I am currently paying for 2x 2tb google plans that I want to eliminate.

GOALS:

  1. Cloud storage sync for my new PC
2. Cloud storage (one time archive) for my old devices (laptop, a couple of ssd's, old phones)

3. Offload long term storage from Google drive (one time archive) and use it for stuff i need to access or share allowing me to downgrade my plan.

  1. Periodic snapshots of Google photos or photos on my phone. Doesn't need to be automated if it requires additional service and cost. Low priority since I plan to continue using Google photos until my free space is full.

I'd be grateful for any advice on where i should be looking. Thanks so much!

2769
 
 
The original post: /r/datahoarder by /u/_MMCXII on 2025-03-08 19:06:37.

Hello experts,

I have a DS718+ I use as a media server. I want to maximize amount of storage I can cram into this thing before looking at an expansion unit, however on the compatibility list the largest supported drive is only 16TB.

What are the risks of using larger drives, for example the Seagate Iron Wolf Pro 24TB with this unit since they are not on the compatibility list? Will these drives even work?

Thanks in advance for the advice!

2770
 
 
The original post: /r/datahoarder by /u/SootyFreak666 on 2025-03-08 18:51:31.

Hello, I am looking at storing some important files (likely a few gbs), I have a few hard drives, just wondering what the best solution would be? I saw that hard drives last 5 - 10 years, I don’t know what that means in terms of actually storage (I occasionally plug them in to transfer stuff), should I be looking at getting a few more to swap things over and as back ups or is that pointless?

I am concerned about loosing these files and don’t think a cloud based solution would be right for me (due to the price).

2771
 
 
The original post: /r/datahoarder by /u/Sgt_JT_3 on 2025-03-08 17:55:56.

Why can some public key encryption standards, like RSA (Rivest-Shamir-Adleman), be easily compromised while other forms remain robust, even though they are based on the same principle of asymmetric encryption?

2772
 
 
The original post: /r/datahoarder by /u/tomauswustrow on 2025-03-08 17:41:25.

The title says it all. I want to download an old Homepage and use it as template. Possible or not ?

2773
 
 
The original post: /r/datahoarder by /u/fiftyfourseventeen on 2025-03-08 17:10:26.
2774
 
 
The original post: /r/datahoarder by /u/gerbilbear on 2025-03-08 16:52:06.
2775
 
 
The original post: /r/datahoarder by /u/unlucky-Luke on 2025-03-08 16:50:13.

Hear me out : i also come from the few gigs HDDs the 90s era, and i can clearly remember how out of reach something like 500 gig HDD was back then.

But it seems to me that it took less time for HDDs to grow in capacity once they reached the 2/4TB stage than it took them from megabytes to 1/2TBs.

In contrast, SSDs have reached the sweet spot of 2/3/4 TBs for quite sometime now (at least 5 solid years) but anything above that and the prices don't make sense for regular consumers, and the availability of bigger sizes is scarce to say the least.

Is it complexity of the technology? Or weak demand ? High cost of production ? I'm genuinely interested to know; why don't we have 6/8/10 tb SSDs at relatively affordable $ per Gig

(Not talking about NVMEs, just SATA SSDs)

EDIT : Just to clarify, I'm not looking for SSDs to replace HDDs, HDDs will still be the "Storage" option for sure (i have 2 24tbs parity in my unraid array, and will go up to 26/30TBs in the upcoming years when they will become cheaper). I just want a Parallel wide SSD Market also with high capacity (8/10/12 tb..) at a good cost (i know that flash drives $/tb is nice right now but it's deceiving cause that price is only for 4tb drives and lower). Also i gave the SATA as an example, I don't really care about the connection (obviously it has to be fast).

view more: ‹ prev next ›