It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
4551
 
 
The original post: /r/datahoarder by /u/Temaktor on 2025-01-17 22:35:42.

Hi there.

I'm looking to build a NAS for my homelab.

I am not looking to get a complete build list, rather I need some input to start my own research as I have no idea about server hardware and no experience with the long term needs of the software involved.

I have a Homeassistant Yellow, some Raspberry Pis for light services.

I have virtualization servers running Proxmox that I only start when needed to save energy.

Now I'm looking to build a NAS running TrueNAS Scale to complete my initial Homelab plans.

My plans for this NAS besides normal filestorage and backuptarget include mediastreaming via Plex or Jellyfin, hosting Nextcloud and running some services that I think would overwhelm a Raspi at peak but don't need a real server most of the time like PaperlessNGX.

(I got into some light datahoarding recently, which has ballooned my storage capacity target a bit... a lot actually xD)

Before I get to planning the actual hardware I would love some input on how to spec it in general...

I could especially use some guidance on the usefullness of read/write cache for my usecase?

I'm thinking of going with factory recertified drives and using RAIDz2, is this a good idea?

While I have built the occasional gaming PC and work in IT, I am unsure about the power consumption of consumer hardware in idle and server hardware is beyond me and I don't really want to get into it too deep.as I have no further usecase.

(I don't have any applicable hardware lying around).

My biggest concern is the power consumption while idle (which would be most of the time).

Some more thoughts / wishes of mine:

  • I plan to go for some 10s of TB, depends mostly on whats left of my budget after the hardware xD

  • I'm thinking of using RAIDz2 with about 7 HDDs.

  • Atleast 10gig, would love 25gig networking (I have a USW-Pro aggregation)

  • Should be rackmountable

The NAS itself will only be used by me for the time being, although some of the intended services will have a few more users, max about 4 total.

(Absolute max budget without storage if absolutely necessary would be 2000€)

Thanks :)

4552
 
 
The original post: /r/datahoarder by /u/snaffleton on 2025-01-17 22:06:02.

I'm trying to continue downloading my TikTok likes and favorites, but when I try, it gives me an error message.

"Error reading local files in your folder. Make sure you can still open "Archive.html".

If you think you may have done something that corrupted your local files, try to restore them by going to [yout folder > data > .app data > backups] and follow instructions. (note: ".appdata" may be hidden on MacOS)"

I'm using windows 10. When I try to open the archive file, it takes me to what looks like a loading screen. It says "Initializing..." and underneath says "Having an issue?"

I've followed all of the instructions given in the folder. I've white-listed the files that it told me to. I've taken the files out of the backups folder and copied them back in. I've tried opening the Archive.html file in Internet Explorer, Chrome, and Firefox. Nothing is working, and I'm extremely frustrated. I need to save everything I can, but it's kinda impossible when the extension doesn't want to work.

4553
 
 
The original post: /r/datahoarder by /u/mason2techie on 2025-01-17 21:47:14.

Hi, I am looking to extend the storage for my servers and am trying to find an inexpensive way to go about this. I saw recommendations for the NetApp DS4246 and DS200, but the 4246 is a bit outside my price range, especially after factoring in shipping cost :/

Does anyone here have any recommendations? My servers are Intel R2208WFTZ and I am located in Northern VA.

4554
 
 
The original post: /r/datahoarder by /u/misteryseeker16 on 2025-01-17 21:32:40.

Hi, I’m looking for a program to convert files from mp4 to hevc, I don’t really care about quality or how it turns out, I just need to convert a couple videos to use them into an app that apparently can only read that type of format (yeah I know it sound stupid) possibly free, I don’t really plan to convert many videos or use it too much, so it would be wasted money.

Thank you in advance :)

4555
 
 
The original post: /r/datahoarder by /u/Vidar4k on 2025-01-17 20:12:58.

Hi Guys,

I have been running this das for a couple of years and its been perfect.

I use drivepool. It has 2 x 16tb, 1 x 10tb and 2 x 8tb.

I only ever got about 40mb/s transfer.. so I upgraded the cable (duh) recently and get 130ish.. happy me.

But drivepool keeps losing the drives. I bought 5 more cables cause I'm a psycho and still have the same issue.

I bought a 3.1 cable at first. same issue. now im onto 3.2 cables.

Any idea where I look to fix this? drivepool? usb drivers?!

thanks!

4556
 
 
The original post: /r/datahoarder by /u/LaundryMan2008 on 2025-01-17 20:10:15.

I haven’t been posting many high quality posts lately on Reddit on any subreddits due to college exams (Unit 1 networking and Unit 2 Access Databases) and other stuff in life happening but I have built up quite a few data storage media in the meantime to post in the upcoming weeks in the month that I was away which I’ll be posting about, also managed to get a job which pays me consistent money which means I’ll be able to buy some more media to hang up on my wall more frequently.

Today I got a LTO cleaning cartridge and a damaged LTO-4 cartridge to hang up (also have a LTO-5 cartridge but I haven‘t opened it yet (sealed) because I haven’t reprogrammed my LTO-5 tape drives yet so it’s not ready for my wall until I have the tape drives reprogrammed and ready to test) on my wall, the cleaning tape I bought to see if it did anything to fix the tape drive from work experience but it was unsuccessful (prior to that, I did a manual clean and after the cleaning tape, I took out the heads and gave them a very good clean with a Q-Tip which was also unsuccessful), the damaged LTO-4 cartridge came from a broken HP LTO-4 tape drive which I managed to fix, the tape had ripped off and the leader came with it so I took the tape out manually and rethreaded the leader into the mechanism which fixed it.

About the 3D printed bezels, the same reason as with the data storage mediums posts, I had to take a pause on that to focus on my exams but since they are done I’ll be getting back to prototyping which is almost done, just move the button down a little and it should be ready to put up on eBay and on a .STL site for 50p.

As for the 4 tape drives that I bought off eBay, I still had some trouble figuring out the pinout of the library edge connector on the sled to give it some UART signals and my USB to UART adapter had gotten lost in shipping so I couldn’t actually try reprogramming it, I know where the power lines are but not the data ones, I do know what pins they are on but since I don’t have the USB to UART converter handy, I can’t actually check which pins are Rx and Tx until I get my adapter, I can give you a small spoiler, I did a head swap between a library drive and the work experience one but I won’t reveal the result until I get all 4 (plus the one from work experience) fully tested and reprogrammed.

Thank you for reading this Friday‘s post and I hope you have a great day, if you have any queries, thoughts about the format, additional information or to point out a mistake, please put them in the comments :)

Link to previous post, post 11 (25th week): https://www.reddit.com/r/DataHoarder/comments/1h8dk0b/my_data_storage_mediums_post_11_25th_week/

All of my tapes so far, I also have a LTO-5 cartridge but I’m not opening it (sealed) until I have my tape drives reprogrammed so it’s not present there

IBM LTO Universal Cleaning Cartridge, ideally I would have wanted an HP one as LTO-6 will be black in my order of all generations and no color duplicates but I’ll probably collect HP and other brand color tapes (including WORM making it nearly double the amount of tapes) anyways just because I can and because LTO is a cool data storage medium format, I’ll then proceed to show off my tape collection like the person with the Windows OS collection when I do

LTO-4 cartridge (holds 800GB uncompressed and 1.6TB compressed, also is the last generation before LTFS is introduced for LTO-5)

Sneak peek at one of the 4 tape drives that I fixed successfully

The reason it was damaged was because the tape drive somehow ripped through the tape and made it get stuck in the tape drive, I extracted the tape manually and pulled out all of the remaining tape in the drive and rethreaded the leader hook back through the mechanism

4557
 
 
The original post: /r/datahoarder by /u/AshleyAshes1984 on 2025-01-17 19:40:09.
4558
 
 
The original post: /r/datahoarder by /u/km14 on 2025-01-17 19:18:45.

I'm an artist/amateur researcher who has 100+ collections of important research material (stupidly) saved in the TikTok app collections feature. I cobbled together a working solution to get them out, WITH METADATA (the one or two semi working guides online so far don't seem to include this).

The gist of the process is that I download the HTML content of the collections on desktop, parse them into a collection of links/lots of other metadata using BeautifulSoup, and then put that data into a script that combines yt-dlp and a custom fork of gallery-dl made by github user CasualYT31 to download all the posts. I also rename the files to be their post ID so it's easy to cross reference metadata, and generally make all the data fairly neat and tidy.

It produces a JSON and CSV of all the relevant metadata I could access via yt-dlp/the HTML of the page.

It also (currently) downloads all the videos without watermarks at full HD.

This has worked 10,000+ times.

Check out the full process/code on Github:

https://github.com/kevin-mead/Collections-Scraper/

Things I wish I'd been able to get working:

  • photo slideshows don't have metadata that can be accessed by yt-dlp or gallery-dl. Most regrettably, I can't figure out how to scrape the names of the sounds used on them.

  • There isn't any meaningful safeguards here to prevent getting IP banned from tiktok for scraping, besides the safeguards in yt-dlp itself. I made it possible to delay each download by a random 1-5 sec but it occasionally broke the metadata file at the end of the run for some reason, so I removed it and called it a day.

  • I want srt caption files of each post so badly. This seems to be one of those features only closed-source downloaders have (like this one)

I am not a talented programmer and this code has been edited to hell by every LLM out there. This is low stakes, non production code. Proceed at your own risk.

4559
 
 
The original post: /r/datahoarder by /u/Puzzled_Student7940 on 2025-01-17 19:17:37.

I am organizing my data and started with a "raw file dump" and copied it and organized the copy. I did it peacemeal over time and want to know if during my organizing I missed any files for transfer. Basically one is organized into subfolders, and the other is organized into different subfolders. Can I compare the actual Jpegs and such to see if there are any files present in one but not the other?

4560
 
 
The original post: /r/datahoarder by /u/kiltannen on 2025-01-17 18:36:11.

I have a case for a Synology 8 bay without a motherboard.

I'm kind of keen to put a controller of some kind in & run a 2nd NAS (I do have running DS1813+) does anybody have advice?

I'm thinking a raspberry pi - maybe a 4 or 5 and then some kind of NAS drive system. Ideally I'd like one that allows mismatched drive sizes like the Synology does, but I don't know what works.

Just beginning to think about this, and looking for advice. Sorry of hoping somebody on here might have done this before & can offer some sage words...

4561
 
 
The original post: /r/datahoarder by /u/RaleighElectroQuest on 2025-01-17 16:54:38.

Am currently trying to archive my collection of Chaotic Trading cards as the game is dead and all available images online are very low quality. Anyone have experience with scanning foil and improving the scan quality of the art? Here's an image reference for comparison. You'll notice some foils look very good while some are coming out almost too dark to see. Am using a Canon LIDE 400 flatbed scanner

https://i.imgur.com/irDo2rN.png

4562
 
 
The original post: /r/datahoarder by /u/FantasticlyWarmLogs on 2025-01-17 16:51:14.
4563
 
 
The original post: /r/datahoarder by /u/parkercodes on 2025-01-17 15:47:46.

Hi everyone,

I'm reaching out to this amazing community for some assistance with a tricky situation. I’m trying to help a client retrieve data from a Samsung 1.8-inch hard drive, Model: HS061HA. This particular drive uses a ribbon connector, which I’ve included photos of for reference (will attach).

From my research, I understand this type of drive was commonly found in older laptops and portable devices. However, finding a compatible adapter to interface it with modern systems (e.g., USB or SATA) has proven to be quite challenging. I’m hoping someone here might be able to point me in the right direction.

Here are the details:

  • Drive Type: Samsung 1.8-inch HDD
  • Model: HS061HA
  • Connector: Ribbon-style connector

What I need is an adapter or interface that would allow me to connect this drive to a more modern system (ideally via USB) so I can access the data. If anyone has experience with this type of drive or knows where I might source a suitable adapter, I’d be incredibly grateful for your help!

Thank you all in advance for taking the time to read this and share your insights. This community has been a lifesaver for me in the past, and I’m hoping you can work your magic once again! 😊

Cheers,

Parker

/parkercodes

https://preview.redd.it/w4hqrzivrkde1.jpg?width=3024&format=pjpg&auto=webp&s=24a1a3ca9cac2341fdd175c09a9285ee71fea483

https://preview.redd.it/y3fj4bhvrkde1.jpg?width=4032&format=pjpg&auto=webp&s=240b88186db16eb511ee980f537fb0589f29a6a7

4564
 
 
The original post: /r/datahoarder by /u/StoutSeaman on 2025-01-17 13:38:53.

I recently found my dad's iPod and laptop. It was thrown into a box of stuff after he died pre-pandemic. I remember how hard he worked at building his iTunes library; whenever he traveled, he would raid the local libraries, friends and family, etc and rip choice CDs. He had very good taste and he also kept things very well organized, even downloading correct artwork, etc. and clearly he was a bit of a pirate. Here's the issue: it's all opera and classical. I don't really care for opera at all and I grew up listening to classicial around him and it honestly bores me.

I know we all have these fantasies of our kids (if we have them) enjoying our music catalog after we're gone, lovingly handing it down to them. I have kids but as it is, they grew up with my old iPod nanos so they literally already know my digital catalog and someday they might be interested in my 300+ LP collection. My dad's catalog is around 5k tracks.

I'm at a loss for what to do with his catalog of music. I don't really want to sell it since it does contain some pirated stuff but has anyone else been in a similar situation and what did you do? In the greater philosophical sense, it does make one realize how unprecious most of what we have is or will eventually be.

4565
 
 
The original post: /r/datahoarder by /u/Fish_Fellatio on 2025-01-17 13:07:34.
4566
 
 
The original post: /r/datahoarder by /u/robinbrinton on 2025-01-17 09:43:36.

There is a website called timinvermont.com which hosted possibly the biggest collection of vintage gay erotica online. Hundreds of magazines and books were purchased, scanned in and uploaded - seemingly by one person - over 20 years. There is a list of all the magazines here (one listing I found was originally published in 1866), and the list for everything that was on the site is here.

It was a paid subscription service with a login which I’ve been told no longer works. It is reported in other forums that the owner has died and the site is now inaccessible. 

It is a real shame that this persons life’s work and passion for buying and scanning all of this printed material has now completely gone from the internet. The waybackmachine has its first snapshot of this site back in 2003. Also it is likely some of these scans are the last traces of these historical works existing.

Not sure if this is the correct place to ask, but I was wondering if anyone can see if all this data still exists online and if it can be saved. I’d be happy to look in to downloading it all if so and potentially finding somewhere else for it to go. Grateful for any advice!

4567
 
 
The original post: /r/datahoarder by /u/ReagentX on 2025-01-16 22:28:44.
4568
 
 
The original post: /r/datahoarder by /u/4619 on 2025-01-16 21:16:42.

The 4k version of a movie is NOT the superior version by default. Movies or series recorded on (analogue) film, which in general is anything before 2000, 9 out of 10 times it's just an upscaled version of the 1080p rescan. From 2000-2010 digital cinematography gained pace and has to be looked into case by case. Only few films get a proper 4k rescan (which then can look marvelous indeed); some film can not be scanned in 4k or wouldn't see any benifit due to the type of film used. Upscaling almost always fcks up something; contrast, fine details, introduce artifacts and more. A very popular thing to do is degraining or cleaning the picture of noise which is a universally hated process by videophiles. The difference in picture quality becomes even more apparent when you look into cel animation. Some of you prefer the shaved look knowingly, i know, but i fear most people just don't know anything about this.

Anyways, instead of shelling out money for always bigger and better drives, hoard the proper rescans in 1080p. I feel 4k torrents have (unjustifiably) better traffic as the years go by and god forbid the og FHD versions disappear at some point.

4569
 
 
The original post: /r/datahoarder by /u/SonicLeaksTwitter on 2025-01-16 20:38:22.
4570
 
 
The original post: /r/datahoarder by /u/Responsible_Pin4589 on 2025-01-16 20:13:18.

I want to view old questions I asked on Yahoo Answers from 2010-2016, but the site was shut down in 2021. I tried accessing the archive at https://archive.org/details/archiveteam_yahooanswers but I’m confused on how to access the data. The Wayback Machine doesn’t allow me to use the search function, I don’t know which files to download, and there’s 35 TB of data which would be impossible to sort through. How would I be able to find my old posts? Thank you!

4571
 
 
The original post: /r/datahoarder by /u/Ballin_Like_Curry on 2025-01-16 20:05:52.
4572
 
 
The original post: /r/datahoarder by /u/megaladon44 on 2025-01-16 19:53:56.
4573
 
 
The original post: /r/datahoarder by /u/MorningLiteMountain on 2025-01-16 18:47:31.

I bought some used drives from Server Part Deals, did a full verify test with Victoria and got no errors. Those drives as well as another one I already had are 12tb and I wanted to copy a nearly full old one onto one from SPD. I only have a laptop so I’m limited to USB speed when copying the nearly full one on to an empty used one.

The first time I tried copying I just used windows explorer and it failed after about 200GB. Transfer speed was 0 and I ended up having to cut power to the drive. The second time I used Tera Copy and it failed after about 600GB because the drive was no longer visible/accessible to the PC.

I removed the drive from the Orico enclosure and put it into an Ugreen one. Although it hasn’t finished copying, it’s about 2/3s of the way through without apparent problem. So should I just assume the Orico enclosure is the problem and throw it away?

4574
 
 
The original post: /r/datahoarder by /u/decipher90 on 2025-01-16 18:32:40.

Recently purchased an external drive, the Seagate expansion 14TB external hdd, it's a 7200 rpm drive and comes with an 18W power adapter. On idle it's 54°C(130F) and while reading/writing it reaches 64°C(148F), should I be worried? I'm afraid if put a small table fan near it it, the constant vibrations will reduce it's life span in the long run, any other cooling solutions?

4575
 
 
The original post: /r/datahoarder by /u/Dirphia on 2025-01-16 17:29:55.

I’ve got an absurd number of photos sitting on my drives, and it’s become a nightmare to sort through them manually. I’m looking for AI software that can automatically categorize them into groups like landscapes, animals, people, documents, etc. Bonus points if it’s smart enough to recognize pets vs. wildlife or separate types of documents!

I’m using Windows, and I’m open to both free and paid tools. Any go-to recommendations for something that works well for large photo collections? Appreciate the help!

view more: ‹ prev next ›