It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
4101
 
 
The original post: /r/datahoarder by /u/noob404yt on 2025-01-29 05:17:44.

Hey everyone,

I would like to introduce you guys to my new Disk Price comparison website - https://diskprice.compardre.com/

This was inspired by the original disk price website (credited on website), but, was coded from scratch, with some additional features like:-

  • Search
  • Advanced filtering
  • Price history (including daily price trend)
  • Price alerts
  • and more..

You can read more about it at https://diskprice.compardre.com/faq.php

Upcoming features

  • Given demand exists, I will add more regions. For now, US and India are added.
  • Given demand exists, LTO tapes and other media.
  • Please suggest.

Member suggestions

  • Add more e-commerce websites, by u/ykkl
  • Filter by data recording tech (CMR vs SMR) by u/Ben4425

I am looking to promote the website among you data hoarding experts. Kindly check the website out, and let me know if any improvements can be made, as it is still in beta. If you can, please share among friends as well.

Disclaimer: As mentioned in the FAQ, the product links are affiliate links, which means, I will earn a small commission when you buy using the links, without affecting the price you get it for. So, I took permission from the mods of this sub before posting about it.

4102
 
 
The original post: /r/datahoarder by /u/zejimmer on 2025-01-29 04:26:14.

I've seen another thread where one person had these drives in service for 12 years.. I have a pair of the 2TB ones coming up to 14 years in the next few weeks, best damn drives ive ever had, not the fastest but I've not had ones that have lasted as solidly in a RAID1 set for as long as these two have, I have considered swapping these out for bigger disks, I dont have anything on them that i dont have backed up elsewhere so its now a test to see how many years these things last, they may yet outlive me or the NAS that they are in.

Anyone else still rocking these pre Samsung peaks of Korean engineering?

https://preview.redd.it/lkemrhd51vfe1.jpg?width=3072&format=pjpg&auto=webp&s=cf76d004930b7cc6103bc09ec1522f7e70f6d82a

4103
 
 
The original post: /r/datahoarder by /u/Systemlord_FlaUsh on 2025-01-29 03:47:04.

I'm considering if I can downsize my movie collection with AV1 in order to save storage space and therefore the need for more HDDs. Did anyone do the above mentioned step, especially for 4K rips and was it worth it for you?

I also consider using my 4700U laptop to do this, otherwise I have only a 9700X/5700 XT system at the moment but I guess it will take forever, if not days. The laptop just draws 15 W and it would mean no problem letting it render 24/7 as it doesn't get hot or loud either. I would not mind if it took months or a year to complete the effort, it would slowly free storage space as I would begin with the biggest movies.

4104
 
 
The original post: /r/datahoarder by /u/cman_56 on 2025-01-29 03:23:25.

Hi All.

I am running Windows 10 on an old AsRock H97M with 8G DDR3. The PC is mainly used to run a PLEX Server.

Windows is installed on an SSD and I have 3 additional 4TB hard drives. I just bought a new 8TB hard drive.

After plugging it in, Windows will no longer boot giving me a SYSTEM THREAD EXCEPTION NOT HANDLED error.

New drive is ST8000DMZO4. I had this same issue and took it to a computer shop nearby when I added a 4GB Seagate last year who fixed the issue. They didn't tell me what the fix was at the time, but exact same issue this time.

The drive correctly shows up in the BIOS but Windows will not recognize it. I've turning on the Hot Swap option for that SATA port, loading Windows with the new drive unplugged, then plugging in the SATA cable once Windows has booted but it instantly gives me the Blue Screen of Death when I plug it in.

Blos firmware is up to date.

I’ve also unplugged all drives except for my SSD that runs windows and this new drive - same issue. I had this exact same issue 1yr ago when I added a 4TB drive that shop ended up getting working.

BlueScreenView shows the SYSEM THREAD EXCEPTION NOT HANDLED Bug Check String Caused by Driver fileinfo.sys with a Crash Address of ntoskrnl.exe+33bc10.

I'm no expert but at the same time not computer illiterate.

PLEASE HELP ME!!!! Thanks!!!

4105
 
 
The original post: /r/datahoarder by /u/staline123213 on 2025-01-29 02:57:49.

Saw a deal on on local facebook market place listing a bunch of ADATA SU800 SATA 960GB SSD with health ranging from 80 to 99% health. Would it be safer than storing data on my 2x1TB 2.5inch HDD, one is an old Toshiba model and the other is a Seagate OEM drive?

4106
 
 
The original post: /r/datahoarder by /u/Alarmed_Rabbit_494 on 2025-01-29 02:23:38.

Hi so I'm trying to sort a very very extensive fanfic library but they're all epub and both windows and mac don't have any easy out of the box ways for me to go through and see the contents without opening them.

I don't need to even read the whole thing just need to see the front page of it so I know what topic it is so I can put it in the right folder.

OR another thing that would work is something that reads all of the epubs so I can easily just search keywords and it'll pop up all the fics that mention a specific topic. Apple finder does not do this out of the box for my version at least.

I tried Calibre and it can 100% be user error but it doesn't easily allow me to go find the file and move it in the source folder. Calibre also wants to duplicate any file i put in it to another folder somewhere on my computer which is a minor gripe i know but I have very little storage for it to be doing that with.

Any help or suggestions would be thoroughly appreciated. Thank you for the read.

4107
 
 
The original post: /r/datahoarder by /u/redcorerobot on 2025-01-29 01:55:49.

So far i keep the standard kind of thing, Ai models, Linux ISOs. Music, TV, Books that sort of thing but I'm starting to consider keeping an actual database which i would fill with stuff like statistics, material properties or interesting numerical data. so i was wondering if anyone here has done something like that, just collecting and storing data in raw format like that

4108
 
 
The original post: /r/datahoarder by /u/Gatecrasher3 on 2025-01-29 01:48:34.

Hi all, I'm looking for suggestions on cases that have 6+ drive bays. This will be used for my NAS, it doesn't need to be hot swappable, nothing fancy just need the space for the ATX CPU/MoBo and the six or more drives.

The only one I can think of is the Corsair 900d but they don't make that anymore.

4109
 
 
The original post: /r/datahoarder by /u/iEatAppIes3465 on 2025-01-29 01:24:59.
4110
 
 
The original post: /r/datahoarder by /u/Pasta-hobo on 2025-01-28 23:19:02.

For anyone not in the now, about a week ago a small Chinese startup released some fully open source AI models that are just as good as ChatGPT's high end stuff, completely FOSS, and able to run on lower end hardware, not needing hundreds of high end GPUs for the big cahuna. They also did it for an astonishingly low price, or...so I'm told, at least.

So, yeah, AI bubble might have popped. And there's a decent chance that the US government is going to try and protect it's private business interests.

I'd highly recommend everyone interested in the FOSS movement to archive Deepseek models as fast as possible. Especially the 671B parameter model, which is about 400GBs. That way, even if the US bans the company, there will still be copies and forks going around, and AI will no longer be a trade secret.

Edit: adding links to get you guys started. But I'm sure there's more.

https://github.com/deepseek-ai

https://huggingface.co/deepseek-ai

4111
 
 
The original post: /r/datahoarder by /u/Imaginary_Condition3 on 2025-01-28 23:16:55.
4112
 
 
The original post: /r/datahoarder by /u/7and7is on 2025-01-28 20:31:01.

Does anyone know how to save magazine issues from Issuu site so that they aren't at the mercy of that site going down?

4113
 
 
The original post: /r/datahoarder by /u/WitherBoss on 2025-01-28 19:59:41.

As someone who probably has a LOT more bookmarks saved than I should have, a lot of the solutions i've come across regarding extracting Twitter bookmarks either are outdated or don't overcome my main issue. While I am able to save some of my bookmarks, after about 15k bookmarked tweets the page artificially stops you from seeing more. A few more tweets are loaded into the html but they aren't displayed.

However, if you unbookmark a few tweets or even just manually delete some earlier ones from the HTML then the page will continue to load more of your bookmarks, so it's not a hard wall. I've tested some Javascript to try and "delete" tweets to push up some of my older bookmarks, but i've consistently been getting errors to do with memory at around the 1500 mark and have been having trouble getting around it. The optimal solution would be to save and unbookmark them at the same time but that runs the risk of the code messing up and me losing some chunks of the bookmarks.

The main point is, does anyone have a solution to this? Either a working bookmark scraper post-API change or a way that I can get around the memory issues.

4114
 
 
The original post: /r/datahoarder by /u/almondicecream on 2025-01-28 19:36:20.

Yah, SPD/LTT f*'d the pooch and so I'm going to give these guys a try. I know this was a SPD lovenest and I have half a PB from them but we gotta find and support new vendors. Seeing prices of $11-$12/TB.

4115
 
 
The original post: /r/datahoarder by /u/emperornorton415 on 2025-01-28 19:26:12.

I have a WD Passport that has decided to not work anymore. I have had this issue before with a Seagate external HD and used a Wavlink USB 3.0 Single Bay Docking Station to transfer data from my old external to my new one through my laptop.

The WD Passport HD does not fit in the previous docking station that I used before. What sort of docking station will I need to extract data from the Passport?

4116
 
 
The original post: /r/datahoarder by /u/Charming_Mix2937 on 2025-01-28 19:24:26.

Hi all,

I'm looking for a bit of advice on my planned home server build. My use case is primarily as a media storage server running Jellyfin and the arr stack (Radarr, Sonarr, etc.). I’d also like to future-proof the setup a bit and have the flexibility to spin up game servers for whatever is popular at the time.

Proposed Build Spec

  • CPU: Intel i7-12700K
  • RAM: 64GB DDR5
  • Motherboard: ASRock Z790 Pro RS ATX LGA1700
  • Cache Drive: 970 Evo 1TB NVMe x2
  • Storage: 4 x 12TB Seagate IronWolf Pro NAS drives
  • PSU: Corsair (not finalised yet but plenty of wattage headroom)
  • Case: Fractal Design Define R5

I’m planning to run Ubuntu, which I’m comfortable with, and manage the filesystem with SnapRAID + MergerFS. The idea is to cache as much as I can to reduce random reads and increase performance. This build has room for expansion in the future, and with SnapRAID, I’ll have 1-disk redundancy to protect against data loss.

The Question

As I’m already spending a decent amount of money, I’m wondering if it would be worth investing a little more and going for a Xeon + ECC RAM setup with ZFS as the file system instead? Or would my current planned build be sufficient for my needs?

I like the flexibility and simplicity of SnapRAID + MergerFS, but I also recognise that ZFS offers strong data integrity features with built-in redundancy. My concern is whether the performance and reliability benefits of a Xeon/ZFS setup would justify the extra cost, given that this is for a home server.

TL;DR

Home media + game server with Jellyfin, arr stack, SnapRAID + MergerFS on Ubuntu. Proposed i7-12700K build looks solid, but should I instead consider Xeon + ECC RAM + ZFS for better reliability? Looking for advice on whether the upgrade is worth it!

Thanks in advance for any input!

4117
 
 
The original post: /r/datahoarder by /u/Mortimer452 on 2025-01-28 18:50:20.

So, in my barn/workshop I have a guesstimate of 5,000-10,000 photos from me & my family's childhood. These have been collected over the past decade or so from family members passing away. 50ish years worth of photos ranging from 1950-2000 or so.

The vast majority are typical 4x6 prints still inside the 1-hour photo envelopes, most have the negatives in there as well. There's also maybe a dozen or so large, well-organized 3-ring photo albums. For those, some I might just pull the photos out, others have captions and such written underneath each picture, so I might try to just setup a little photo booth and take a high-quality photo of each page.

I think there might also be a couple boxes of 35mm slides.

I've heard good things about the Epson FastFoto for scanning, is this still a pretty good option?

Also looking for advice on a photo management solution. Ideally I'd like something 100% self-hosted that can do object and face recognition and tag the photos for future searching. I give it a pile of photos, it tags objects like car/dog/bike/baby and gives me a list of detected faces, which I can assign names to, and then easily pull up all photos of that person.

I've used PhotoPrism for this in the past and it's decent, but wonder if there's anything better?

4118
 
 
The original post: /r/datahoarder by /u/LibraryComplex on 2025-01-28 18:50:02.

So I work with LLMs and ML models, my machine will run out of space in a while hence I was thinking of buying external storage. My main aim will be to store my ML models on there and be able to use them when I need to(basically daily) by plugging in the flash drive.

I've read these can be unreliable and are meant only for data transfer.

I likely won't be storing anything irreplaceable, if I have something like that for example, would likely store a copy on Google Drive as well. Would you say this is fine for me or should I look into something other form of storage?

4119
 
 
The original post: /r/datahoarder by /u/52-61-64-75 on 2025-01-28 18:39:15.

On the topic of US gov sites removing data, does anyone know if theres backups of these? I've looked a bit and havent found anything, I'd consider scraping them myself but dont really have the infrastructure to do it and I dont have much experience web scraping, nor would I be able to get to it for a while as I have exams

4120
 
 
The original post: /r/datahoarder by /u/Legitimate_Pea_143 on 2025-01-28 18:23:34.

I'm using Jdownloader to download files from bunkr. I thought all you had to do was copy the link into linkgrabber and select download and it would automatically start downloading the files, but it's not doing that for me. It seems like it is trying to download all the files at once so I'm getting an error of too many connections at once, so i have to stop the download process and then right click each file individually and select force download/start and that works. I'm guessing people here have alot more experiance with Jdownloader then I do so can some help me out. I am mostly using it for downloading bunkr albums. I have tried a dedicated python script bunkr downloader but it errors out 99.999% of the time I am specifically using Cyberdrop.dl and it just doesn't seem to work at all.

4121
 
 
The original post: /r/datahoarder by /u/Rocas21 on 2025-01-28 16:56:04.

Like most people, I’ve been using Google Drive for years to store my files. It’s convenient, but recently, I’ve started to feel uneasy about it. With AI becoming more powerful and data being such a valuable commodity, it’s hard not to wonder:

• Who really owns my files?

• How is my data being used behind the scenes?

• What happens if there’s a breach, or the platform decides to change its terms?

The more I think about it, the more I realize that centralized platforms like Google Drive and Dropbox have too much control over something that should belong to me—and only me.

That’s why I’m building Vaulted. It’s a simple, decentralized cloud storage platform designed to give you:

• Full ownership of your data leveraging Cere Network —no tracking, no ads, and no third-party access.

• True privacy—everything is encrypted and stored on a decentralized network.

• A clean, easy-to-use interface—just upload your files and know they’re safe.

Right now, Vaulted is still in its early stages. The initial version is simple: you can upload and store files privately, securely, and completely under your control. It’s not fancy, but it solves the problem of data misuse and centralization.

https://tally.so/r/mKW17g

I’d love to know what you think. If you’ve ever felt frustrated with how your data is handled by big tech,

As a thank-you, you’ll get $10 in free storage credit when we launch if you would like to fulfill a form

4122
 
 
The original post: /r/datahoarder by /u/Practicing_Stoic_28 on 2025-01-28 14:38:20.

Multiple answers in past but none seems to work . Please support me with the best possible alternatives

4123
 
 
The original post: /r/datahoarder by /u/Harmacist88 on 2025-01-28 13:48:39.

I've been using a 6TB WD Black 7200 RPM HDD for the past 5 years. I mainly use it for long term media storage (file downloads, documents, music, movies, photos, gameplay recordings, etc.) as well as temp storage for games I'm not actively playing. I recently put together a new rig and threw the old drive in there and the thunking noise of the head parking every few seconds is driving me crazy. I don't think anything is wrong with it; I ran the WD SMART diagnostic thing with no issues and this drive has always been known to be noisy. Maybe it's because I finally have the PC at ear level again, maybe it's because this new chassis doesn't dampen noise as well as my old one, maybe it's because the rest of my components are now pretty much silent, maybe I just tolerate it less as I've grown older, I don't know, but I can hear the thing from across the living room and it's very grating.

I'd like to replace it with something comparable that isn't quite as noisy. I understand that it is a performance drive so a quieter replacement may be less performant, but I'm willing to make that tradeoff. Prior to this drive, I used a 2TB 7200RPM Seagate Barracuda and I found the noise profile on that one to be tolerable; is there something similar in the 6-8 TB capacity range? I'm not too familiar with NAS drives or whether they fit my use case. Also not really sure whether I truly benefit from a performance drive--it might be a holdover from older builds where it was more common to actually run games off the HDD.

I've looked into SATA SSDs like the 8TB Samsung QVO but they're still really expensive, and--maybe an outdated view--I still worry about their longevity vs. HDDs.

Any recommendations?

4124
 
 
The original post: /r/datahoarder by /u/Dramatic-Pepper-8332 on 2025-01-28 13:38:10.

Hello!

I need help to locate Windows XP drivers, or any drivers really I can get my hands on, for an Exabyte 8200 tape drive. Currently, I have the drive hooked up SCSI to a 32-bit Windows XP machine. The machine sees the drive but with a question mark next to it. Any help would be greatly appreciated.

4125
 
 
The original post: /r/datahoarder by /u/LibOverlord on 2025-01-28 07:28:46.

Does anyone have any suggestions on the best way to get a screenshot of the full YouTube page where it shows the Creative Commons license ( for those videos that are cc licensed? )You have to click more before it shows up. There are some CC licensed channels that I want to download, but want to save an image where it shows the license as well.

view more: ‹ prev next ›