It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
4751
 
 
The original post: /r/datahoarder by /u/clav1970 on 2025-01-07 01:52:59.

I'm a PhD student and digitize text books to use during classes. I make pdf's so I can read them and use electronically. I also, like the spiral bind them to lay flat etc...

I'm trying to use scantailor advanced, but am bogged down with trying to install it. Apparently, using windows I have to build the program and don't really have time. Anyone have or know a good tutorial on how to build this program or an alternative free or low cost program.

My goal is to have a pdf that is low file size, but good quality to read and use. The current process is to scan at 300dpi and then compress, OCR and then use. File sizes are decent, but not great, but quality does get much worse.

Any suggestions?

Dennis

4752
 
 
The original post: /r/datahoarder by /u/GTRacer1972 on 2025-01-07 01:47:36.

I have a symmetrical connection of 1gbps up/down. With iDrive running it drops to like 700 down 800 up. I just paused it, waited a few minutes and ran the speed test again and it's now showing 500 down 700 up. Is this thing just holding that much network bandwidth in reserve while it's paused? If it is there's no point in using pause.

4753
 
 
The original post: /r/datahoarder by /u/DepartmentFun5626 on 2025-01-07 01:15:27.

I have an old Go Flex Home Seagate NAS drive that got factory reset is there anyway I can access the data or am I out of luck. I contacted seagate support and they said as remote access is not supported on the device it cannot be accessed anymore??. I tried to ing it but the device does not show up in the dos CMD arp - a Ip list. I tried to connect via USB but does not show up in the windows explorer.

Please offer advice if you any of you have gone through this before....

4754
 
 
The original post: /r/datahoarder by /u/Comfortable_Toe606 on 2025-01-07 01:04:19.

I'm partially complaining (thanks for listening) and partially asking for advice.

My goal is to have a stable data server that can also be used for Plex and various miscellaneous temporary projects (things that catch my interest and probable would be isolated in Docker).

  1. Must be Linux.
  2. I don't care about graphics; the server will be headless most of the time.
  3. Must serve data via wifi.
  4. Must have data protection (ie. RAID - probably R5:4+1).
  5. Must be able to host at least 40 TiB.
  6. Must support Plex and/or Jellyfin.
  7. Must be stable.

I bought a TerraMaster D5-300 (USB 3.1) and stuck some miscellaneous drives I had laying around in it. I didn't try to do RAID or anything. I've just been trying to use it as JBOD. I connected it to an M1 MacBook Pro via Thunderbolt/USB4, but the TerraMaster had a bad habit of disconnecting, requiring a power reset to get it to reconnect. After much troubleshooting, Googling, and hair-pulling, I wiped my rather nice work Dell laptop of it's Windows OS and put Linux on it. The TerraMaster, still connected via USB, has the same issue of dropping offline from the Linux box as it did with the MacOS box. Sometimes it will go two or three days without issue and sometimes it will happen every few minutes. I have tried everything I can find online to fix it but to no avail. I've fiddled with power settings, esoteric device driver settings, and everything else I can find online. I've swapped out the USB cable as well. When it's working, it works great.

So I've decided to just buy a better solution. I've been thinking about a NUC or Beelink along with some sort of external 5-bay enclosure, using Linux and software RAID. About 32 GB of RAM to give me plenty of headroom (32, 33... whatever it takes (it's a movie reference, people, please don't educate me on memory increments)). At first I was thinking of using eSATA for connectivity but now I'm hearing that eSATA is a bit archaic. I'm a little gun shy about trying to use USB because of my current problems. A NAS doesn't fit the bill because serving data is only part of the use case. I am not a Plex guru, and have no idea (and frankly, don't care enough to learn) whether I'm transcoding, transferring, or Trans-Am'ing; I just figure an Intel i7 or better CPU should do whatever Plex needs to do. I've got a long way to go and a short time to get there.

The whole process is getting to be too complicated and frustrating for what I'm trying to accomplish. I just want a stable, Linux-based Plex server that also has enough umph to let me fiddle about with non-Plex projects as well. I feel like I'm missing something obvious here. The Linux and Plex part should be simple enough, but before I spend $1K+ on something I want to make sure it's going to work. I already feel like I've wasted my money on the TerraMaster.

Can somebody please throw me a lifeline here and give me a nudge in the right direction? Then I can take my commandeered work laptop and return it to its rightful duty of crushing my soul.

Thank you!

4755
 
 
The original post: /r/datahoarder by /u/RealityOk9823 on 2025-01-07 00:53:57.

Hi all,

I guess the real answer to this question is "plug it in and see", but wanted to see if anyone had any experiences with a Dell Perc H310 board in IT mode paired with an X99 board, like a Qiyida or Machinist branded one.

I tried using it in an old Dell just to see if the SAS drives I picked up worked. It worked, but took 5+ minutes to boot and the drives didn't show up in CrystalDiskInfo (but did show up in Explorer). Thinking the X99 board would probably handle things better. Also planning to strap a fan to that heatsink since it gets so hot.

4756
 
 
The original post: /r/datahoarder by /u/g76n on 2025-01-07 00:13:08.

Hi guys, i just got my new PC case recently and soon realized it couldn't fit my 2 HDD drives which i use on a daily basis, i have others as backups and don't mind keeping out but i really like to use these and i'm trying to find a way to fit this extra, the case is the Mars gaming MC-3TCORE.

My current installed is behind that white tray, there's a little more space behind it where the cables are, but it would have to also be mounted horizontally as that one

Thanks

https://preview.redd.it/mrev17p5sgbe1.jpg?width=1695&format=pjpg&auto=webp&s=d267eac2b21b35e1e29b022ba0d34ba2d45ea031

4757
 
 
The original post: /r/datahoarder by /u/Colbybrickwell on 2025-01-06 20:31:13.

Currently have a thunder bay 8 with 5 20TB seagate Exos drives inside running raid 5. Every time I start the raid back up, soft raid gives me a message saying that one or more disks are missing, that the volume is out of sync, and that it needs to rebuild. It always rebuilds and is totally fine after, but it definitely feels like that shouldn’t happen every time lol. Anyone know if it’s because not all 8 of the slots are filled up?

4758
 
 
The original post: /r/datahoarder by /u/Monfitis on 2025-01-06 19:31:33.

I'm very interested in archiving certain Instagram accounts through scripts, like using gallery-dl, but i have not been able to find good scripts for it, especially because none keep highlights nor are organized.

I'm looking for a script which downloads all posts, reels, tagged posts and highlights and keeps them organized through folders from specific Instagram accounts.

I'm not asking for someone to make a script for me, just wondering if anyone has one to share with me, as this is a datahoarder subreddit.

thanks for listening !!!!

4759
 
 
The original post: /r/datahoarder by /u/toxicfae on 2025-01-06 18:21:14.

I have quite a few photos I want to digitize. I'm a student so I have access to a couple of scanners via my school:

Epson Perfection v30 & v37

Epson Expression 10000XL

Epson WorkForce DS-50000

HP Scanjet 5590

Epson V600 Photo Scanner

Which of these would be best to use? They're regular-size photos printed on glossy paper. I'm inclined to think the photo scanner is best just because it's called a photo scanner, but I don't know anything. Please don't tell me to buy a scanner haha I don't have the money and there's not enough photos to justify that. Thank you in advance, happy hoarding :)

4760
 
 
The original post: /r/datahoarder by /u/Ms4sman on 2025-01-06 17:28:57.

UPDATE: I did hear back from them! They did issue me a return label and will take the two drives and replace them! So that's good. Hopefully the replacements are better!

Hi all,

After seeing a lot of buzz lately about recertified drives from places like ServerPartDeals and GoHardDrive I decided to give GoHardDrive a shot (mostly just because they had the type of drive and size I wanted in stock and ServerPartDeals didn't at the moment). I read a lot on reddit about both beforehand, and while I do see some negativity about goHardDrive, it seemed like most people who had actually shopped with them in the past were very satisfied, and even when they had issues, their support/warranty took good care of them.

So my drives arrived on Friday the 3rd. I got 2 Seagate Enterprise 12TB drives. I open them up and look them over. They had a couple of cosmetic scratches but overall very good. Connectors didn't show any real visible wear.

I put them into my unRAID server. Now, I got in a hurry and I should have definitely run the extended SMART tests on them very first thing but I didn't. That's on me. But anyway, I put the first one into the parity spot and let unRAID start rebuilding the parity onto it. This took about a day and no errors in the process. Seems good so far.

The second one I had planned to put into the array. So I add it to the array and unRAID starts its clearing process of zeroing the entire thing. By the end of zeroing, I had 2 UDMA CRC errors (which could be unrelated, my understanding is that these are more an issue with a bad cable or connector) and 8 reallocated sectors, 8 pending sector, and 8 offline uncorrectable sectors. Not a good sign less than 48 hours in. So I run the extended SMART test at this point and run it on the other one too, in the parity slot. The test finished on the array drive without finding any new issues on top of the ones already listed, but that's concerning enough. The parity disk though ended up with 32 reallocated sectors at the end of the test.

Now, I will grant you, I should have run the SMART tests first thing. But luckily, I hadn't loaded any data onto the array drive yet so nothing to be lost there. And the parity drive can also be removed and I can go back to my old one in the meantime while I get this sorted out. So no data lost thankfully. The drives aren't outright failed either. But to be getting reallocated sectors almost immediately after installing doesn't bode well. I have drives in this server that are 7 years old without a single reallocated sector or any other SMART errors either for that matter.

I know others on here have had good luck with their RMA/Warranty service taking good care of them. Seems like most people have been given a shipping label and everything. I followed the info on their website though and filled out the RMA form and just got an RMA document to print and put in the package. Also, I filled out the RMA when only one of them was getting errors (should have waited, I know). So now I need to probably send both back, and my RMA only accounts for one. I tried to reach out to the RMA email address, but have not heard back yet (admittedly I emailed on a Sunday and its only Monday morning, but some people here seemed to even get a response back on a Sunday!).

I'm a bit gunshy now about trying more drives from them. Seems like most have had overwhelmingly good results with them, but it seems odd that I'd get TWO duds in my first order. I don't think I did anything to them to cause issues...like I said I have other drives in the same server over 7 years old without any problems.

4761
 
 
The original post: /r/datahoarder by /u/CherubimHD on 2025-01-06 16:21:04.

Just saw a post here that shows that the cost per TB has been rapidly decreasing and several comments pointed out that one can get drives for as low as 6$/TB. I’m wondering where do you actually get those drives that cheap? Here in the UK you pay 163£ for an Ironwolf 8TB. That’s ~20£/TB = 25$/TB.

Am I just looking wrong?

4762
 
 
The original post: /r/datahoarder by /u/TheMaxClyde on 2025-01-06 16:12:17.

I have a bunch of external hard disk drives lying around that are getting filled up. Maybe I'm a data hoarder at heart.

I need 2 TB of storage to store a bunch of files long-term. I also own a PS5 (seems to be something some people mention here and there)

Anyway, I was going to buy a sandisk extreme portable SSD but I found a few posts advising against it:

So I figured I'd get an "SSD with an enclosure" as a lot of people here suggest. I know "all digital media can fail" but I don't feel like paying more for Google/OneDrive storage which would be more of a subscription than a but-it-for-life thing.

I've found a bunch of SSDs "with heat sink" for PS5, thought I'd get them and put them in an enclosure if it's possible? I don't know.

Here they are:

I also found this but it's kinda out of budget: https://www.amazon.ae/Portable-Resistant-Photographers-Compatible-MU-PE4T0S/dp/B0BXBSYTZR

My budget is around $280.

4763
 
 
The original post: /r/datahoarder by /u/Demony83 on 2025-01-06 22:45:02.
4764
 
 
The original post: /r/datahoarder by /u/Darth_Stig on 2025-01-06 19:30:54.

I need raw storage, like ALOT of raw storage; possibly over 100TBs from all the videos I have. Right now, my current build is a custom Corsair 900D (look up the size) with a bunch of drives underneath my computer but it gets flipping hot in the summer time and I'm kind of over it, likely 15+ HDDs. I plan on consolidating with a bunch of large plattered HDDs to reduce the amount, but likely I'll need around 5 (could be fine at 4). When my wife wants to bring up videos of our kids, or I grab my laptop to work instead of going up to my office, the pulling of data off my rig is super slow. This might be caused by a slower router or a distance issue since the router is fairly far away from my office. Regardless, putting something closer either wired into the router, or at least more central and wireless is probably a better idea to access all these HDDs.

I saw an old thread on here where a guy just built his own "mini server" and I'm thinking of doing the same if there are benefits outside of just having another computer in the house. Outside of the brand name recognition and their software being pretty good, does anything extra come from getting a NAS specific device like a Synology? If I build a computer, do I just run Windows and use the kind of junky network stuff built into windows explorer? Is it just as reliable/fast? Can I get away with lowish RAM/mediocre processor?

4765
 
 
The original post: /r/datahoarder by /u/tiandongchaser on 2025-01-06 18:49:39.

Looking to pull the trigger soon on a storage build at home so it can't be super loud. It seems to be the general consensus that WD Red Plus is the favourite over IronWolf Pros in terms of reliability and noise, but the Red Plus looks to have a significant premium attached. I am looking to build a 4-5 drive system.

For example, Red Plus 12TB cheapest available is £244:

https://www.idealo.co.uk/compare/201101209/western-digital-red-sata-iii-12tb-wd120efbx.html

The same site has IronWolf Pro 12TB for £144:

https://robertelectronics.co.uk/products/seagate-ironwolf-12tb-st12000vn0008-nas-hard-drive-3-5-7200rpm-256mb-cache-cmr?_pos=7&_fid=90950176a&_ss=c

Meanwhile, and even on Amazon, you can get a 12TB IronWolf Pro for £200 or a 14TB for £199:

https://www.amazon.co.uk/Seagate-IronWolf-12TB-SATA-Drive/dp/B07LH55GC2

https://www.amazon.co.uk/Seagate-IronWolf-7200RPM-256MB-3-5-Inch/dp/B07H7CKYGT

Long story short, given it seems to be £50-100+ difference per drive, is there any reason to still go with the WD Red Plus, or should I just hope the noise isn't too bad? Lots of people seem to complain about it, but would you be able to hear it over, say, a TV?

4766
 
 
The original post: /r/datahoarder by /u/FlavioLikesToDrum on 2025-01-06 16:40:17.

NAS or similar? What type of storage to pair with a mini pc N100?

Following up on this thread https://www.reddit.com/r/homelab/s/88Z7AO9fpj

What would be the best solution for someone that has a N100 mini homelab with only USB ports and wants to have expandable storage pool. There seems to be the agreement that USB DAS are not great (or at least animosity towards it, the point here is that I want to probe about alternatives where usb 3 and network are the only connections), so what is the alternative? A NAS? If yes, what would be a good choice for someone that has a N100 running services like plex and other media services that barely breaks a sweat and the NAS would just be serving files. Or maybe other solution? What would you suggest?

4767
 
 
The original post: /r/datahoarder by /u/thirteenthtryataname on 2025-01-06 15:50:44.

Not sure if this is the right place to ask but figured with enough users here sitting on stacks of drives, at some point, one would want to dispose of them with minimal risk. I'm aware I can just go the old fashioned/direct route and zap a few holes in each drive with a drill but ideally would want to find a service to do the destruction and material collection in one stop. Casual searching online always seems to turn up services geared toward commercial customers dealing in bulk quantities and/or wanting to quote for a specific request.

Does anyone know of a consumer-friendly service that offers secure drive recycling, even if it comes at cost? I used to live in an area where the county would have an organized semi annual event for residents to bring their materials to be safely shredded and included hard drive destruction which was really nice but I don't recall what the service was and it was just communicated by mail to even approach them.

As drives begin to die on me or rack up failed sectors and go to rest on a shelf if they're out of warranty, I'm realizing that I'll need to make room at some point.

4768
 
 
The original post: /r/datahoarder by /u/laktakk on 2025-01-06 15:07:35.
4769
 
 
The original post: /r/datahoarder by /u/jonmppa on 2025-01-06 12:43:25.

dates and filetypes go crazy

Found an old DashCam from my glovebox and decided to check what's inside. Some of the files are recognized to be the correct AVI -file, but none of them cannot be opened.

The data inside was not important anyways, but decided to share what looks like a data loss on a flash memory drive. The sd card has been sitting on a glovebox for 5+ years with -30c to +30c temperatures.

Feel free to correct me, and let me know if some of the files could be restored.

https://preview.redd.it/r8mu6vgfddbe1.png?width=659&format=png&auto=webp&s=d671548724ce45a1226db79b2e9fc360e3fdd52d

4770
 
 
The original post: /r/datahoarder by /u/fjodor89 on 2025-01-06 12:41:07.

Hey guys. I need to convert my tapes and dont understand the output cables from the camera. It usually is 3. Any recomendstions on what converter i need to buy?

Thaks

4771
 
 
The original post: /r/datahoarder by /u/Halfblood200 on 2025-01-06 12:38:54.
4772
 
 
The original post: /r/datahoarder by /u/deadb3 on 2025-01-06 12:02:14.

So, all outbound internet traffic is going to be banned soon by geoip and I need to build a setup for programming and keeping my sanity with the help of content. Do you know what else should I selfhost?

I've already built a beefy homeserver on r5 3600 with 4 tb of disk space (2 hard drives costed more than the whole server lol)

Requirements

  • python development with local dependencies management. Pip builds local packages offline only with a hack. Scipy/numpy docs
  • g++/clang toolchain and access to popular libraries, local linux mirrors hopefully are going to work. Sadly, keeping a local copy of github would require an arctic bunker
  • I'd like to learn gnu radio and reticulum for wrapping tcp over cw, but I'm not 100% sure which libraries/docs I would need

What's been already done

  • local wiki (kiwix) and full stackexchange archive
  • jellyfin server with some shows & anime
  • qwen 2.5 14B & 35B on my main rig for compressed internet knowledge
  • lots of development libraries scattered over my PCs

TODO

  • figure out how to deploy stackexchange archive
  • download some manga (perhaps using tachiyomi)

So, what else should I do?

4773
 
 
The original post: /r/datahoarder by /u/Samuel__Vimes on 2025-01-06 11:39:09.

So, hardware has always been my weak spot, which now proves to be a problem. I recently made a mistake when configuring my AI PC, which was pretty costly, and I want to avoid that error when it comes to the NAS I plan to build this year. When looking up consumer mainboards, a lot of them support ECC-RAM, but when you look in the manual, it says stuff like "runs in non ECC-mode" which, of course, is exactly what I'm NOT looking for.

I have researched quite a bit, but I'm struggling of juggling with the multitude of information, especially regarding the main boards.

I have the following use case:

  • Mainboard needs to be able to support a HDD controller AND a graphics card (for a full 16 HDD)
  • Full Support of DDR 5 ECC RAM (Mainboard and Processor)
  • If possible, the processor should have a graphic's unit.
  • 2+ M.2 would be optimal
  • Consumer hardware

I need both a graphic's unit on the processor and a graphic card so the graphic card can run media encoding stuff for a Jellyfin Server I'm planning.

Any help or pointing in the right direction would be appreciated.

4774
 
 
The original post: /r/datahoarder by /u/shagbag on 2025-01-06 11:25:41.

Hi,

I have a colo server and I’m looking for things to do with it.

Would anyone be interested in a native NBD file server?

You would be able to mount a file on my server as if it was a drive on your computer.

Then you can store your files easily on the drive.

This is already possible with things like S3 fuse, but these solutions are extremely glitchy since the underlying API is not made for a block device.

My solution would be rock solid and hosted at a professional data center.

I recommend encrypting your files before uploading them if they are at all sensitive.

Other than that it should be extremely easy to use.

Please send me a chat message if you are interested.

Thanks!

4775
 
 
The original post: /r/datahoarder by /u/Lucretius00 on 2025-01-06 11:05:21.

Hello, i use some websites to download stuff (videos they dont host anything, they point at those streaming services, they serve as a database for the links) , how do i save a complete one e that point to online links when using it offline? thanks in advance

view more: ‹ prev next ›