It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
2626
 
 
The original post: /r/datahoarder by /u/Unusual_Poem_9864 on 2025-03-14 04:28:54.

Okay, captured minidv taped with WinDV and set it to split into clips instead of one big file so I can see the time and date each clip was taken, and now I want to join them in virtual dub without re encoding using direct stream copy and append clip. Problem is, I can only figure out how to do one at a time. There's like a hundred clips per tape, and I have tried highlighting all of them and dragging them into virtualdub while holding control but it puts them out of order. How can I combine all of them at once and keep them in the right order by file name. Or do I need some software besides VD. I do not want to just throw them into an editor and end up re encoding them. Thanks.

2627
 
 
The original post: /r/datahoarder by /u/Forsaken_Pea3464 on 2025-03-14 04:02:55.

I used an extension called myfavett on chrome but that only grabbed about a 1000 videos and refuses to download any further. Anyone know any workarounds?

2628
 
 
The original post: /r/datahoarder by /u/SummerWhiteyFisk on 2025-03-14 03:52:54.

Right now my set up is an M4 desktop Mac + 2tb external hard drive (for now). I’ve saved a handful of movies and shows on it and have been watching them through infuse on my Apple tv. Have been very satisfied with how it’s all worked out so now I would like to begin the process of going full hoarder mode and really start loading up on shows and movies.

My immediate first use case is that I want to add all my favorite shows - mainly 30 min sitcoms like Seinfeld, trailer park boys, it’s always sunny, etc. to the drive. Using Seinfeld as an example, each episode is roughly between 800mb and 1gb as it stands now.

I own Apple compressor and would like to run all these shows through it to save on space. Any recommendations for format/audio/visual settings? HEVC? h264? h265? MP4? Other? Really don’t need super high quality here, certainly not 4k, but was thinking 1080.

Also would be curious to hear streaming platform recommendations. Infuse has been terrific so far but didn’t know if plex, jellyfin, kodi were worth a look or better in any way. Thanks in advance

2629
 
 
The original post: /r/datahoarder by /u/Specific-Judgment410 on 2025-03-14 03:49:51.

I've essentially archived a website and want to be able to view it in say Kiwix but that takes ZIM files, so I want to know how I can compress all the html files and folder structure into a zim file that I can view offline or maybe a WARC (i'm not sure how this would work).

The alternative is that I create an app that has a browser that can open html files by decompressing on the fly into ram for example but I feel like this is what a ZIM is. Can anyone help? Thanks.

The reason I'm not using a tool like ZimIT is because I have to edit the html code to eliminate cookie popups, so now it's nice and clean ready to be archived/zimmed up.

2630
 
 
The original post: /r/datahoarder by /u/Titan_91 on 2025-03-14 01:26:22.

Back at the Barnyard was a low/medium budget Nickelodeon spinoff TV series following the 2006 film with 2 seasons airing from 2007 to 2011 in 4:3 (fullscreen) 480i standard definition picture format. Reportedly no episodes were aired in widescreen although widescreen clips do exist. It appears to be framed for fullscreen, as virtually all broadcast content in the US was either SD fullscreen or HD widescreen.

From the little research I've done there were 2 DVD box sets, one for each of the 2 seasons with 6 discs each. Each season had 26 episodes. The discs were burned DVD-Rs manufactured on-demand through Amazon CreateSpace around 2016. CreateSpace no longer exists and the discs are obviously out of print now. And we all know what happens to DVD-Rs after just 10 years.

In the land of peg legs and eye patches, I see most (if not all) episodes captured and encoded to x264. I also see both seasons muxed individually as 2 really long videos encoded to h.265. There is a single episode in DVD ISO form on Archive.org from some kind of Soongebob pack-in DVD which I suppose can be used as a quality reference against the others.

https://archive.org/details/newnickdoublehitdvdiso/SPONGEBOB_SQUAREPANTS.ISO

I'm sure both of these sources are decent quality, but is anyone aware of full DVD ISOs or MPEG-2 muxes of this show that haven't been re-encoded? Regarding the show itself, I find it more akin to Cow and Chicken with actual funny writing and plenty of adult humor.

There is currently a single listing on eBay for the season 1 box set but it's $55 and missing a disc and no possibility to simply to rip and return:

https://www.ebay.com/itm/167320422091

Theres also this site but it's most likely low quality bootleg stuff:

https://www.backtothe80sdvds.com/product_info.php?products_id=3104

It was, or is, available for streaming on Paramount+.

2631
 
 
The original post: /r/datahoarder by /u/Frosty_City_4809 on 2025-03-13 23:42:29.

so looking for ways to expand my nas and was thinking of doing a external sas to sata and was wondering if this is a good idea to power them since i have a unused gpu cable

Amazon.com: Nuhikap ATX 6/8pin 12v to 8 Ways 5v/12v 3A Power Adapter for ATX PSU and 2.5'/3.5' SATA HDD Power Supply Breakout Board Adapter : Electronics

has anyone tried this or think its a good deal?

2632
 
 
The original post: /r/datahoarder by /u/dozer00 on 2025-03-13 22:16:15.

Hi, I'm planning to buy an HDD to use as external backup and I noticed that many users recommend WD Ultrastar DC HC550 or Seagate Exos X18 because they have 5 years warranty but someone told me that some brand puts constraints on these extended warranties for example if the HDD isn't purchased from an official distributor or on some enterprise level HDD.

What about those model of WD and Seagate?

Is the 5 years warranty available for any users and any type of use of the drive?

Thanks

2633
 
 
The original post: /r/datahoarder by /u/byteme113 on 2025-03-13 21:42:20.

Hello! Originally posted on another sub but this ones seems more appropriate.

I'm working on birthday gift for my best friend and wondering if what I want to do is feasible.

Context: Her favorite show is Daria, but for the dvd release they replaced all the music due to licensing constraints. There's already been a huge effort done in the Daria Restoration Project that puts the original music back into the episodes.

I have those files in an MKV format, I could stick them on a USB and be done--But I want to go the extra mile.

I'd like to get a copy of the dvd boxset, rip it--probably encode it based off of some light reading in this sub--and replace the official audio (maybe video files if necessary) with the ones from the DRP, all while hopefully maintaining all of the existing menus and special features etc

It's a couple months till her birthday so I'm going to be researching and figuring it out till then. Any advice or guidance is appreciated!

2634
 
 
The original post: /r/datahoarder by /u/crazyhubble on 2025-03-13 20:57:17.

I am not sure if this is a r/PleX question or a r/Datahoarder question but being it's Plex related, I thought I'd start here first.

I am trying to find a way to automagically sync files to an external drive for travel.

I have Plex automated to download new episodes and I am aware I can just have it make an optimized version to the external drive but I cannot seem to get my optimized versions to work without a ridiculous amount of user input in the most recent version. Also, I use an iPad Pro (2020) for travel and it will not use the external drive as a source for Plex.

I am wondering if anybody knows of a way to have my server look at what is on my external drive, look at a folder (Random Series Folder), compare the 2 and move episodes that are non-existent on external drive but exist on server, to the external drive.

I want next to zero user input. My job entails getting randomly called in at 2 in the morning, and driving 6+ hours to random locations, and sometimes spending multiple nights in a hotel. I would like to plug it in and forget it until I need to go somewhere.

I do realize remote access exists but I am often in areas with little to no internet access. Downloads also exist but I have the 128GB model and that fills pretty quick. I would like to be able to unplug from server, leave, and transfer from external drive (or watch from it).

Synctoys used to exist and seems like it would work rather well but it is pretty non-existent at this point.

I am open to options and if you have any other suggestions, they'd also be appreciated but from what I have found, syncing a folder with an external drive and watching via VLC seems to be the best option. I am more than capable of "marking watched" when I get home to my Plex server.

2635
 
 
The original post: /r/datahoarder by /u/waldesnachtbrahms on 2025-03-13 18:10:59.
2636
 
 
The original post: /r/datahoarder by /u/Extension-Skill8469 on 2025-03-13 18:08:38.

I have a G-Technology RAID external HD 10TB for back ups ( similar to this: https://a.co/d/iQ6bNo6 ). What is the best way to protect/store my external HD long term? I live in Colorado so I wanted to an ESD bag but this HD is a box shape and I don't think will fit in the usual flat esd bags they sell. I was looking at things that might fit like electronic dust covers and large hard cases with foam but they don't seem to offer static protection. Any suggestions?

2637
 
 
The original post: /r/datahoarder by /u/FaithlessnessNo5579 on 2025-03-13 18:04:57.

Hello everyone,

I have ordered (and still waiting to arrive) a brand new Seagate Exos X - X20 20TB ST20000NM007D.

I have heard about the fraudulent Chia HDDs and would like to ask if anyone knows if the 20TB HDDs are affected?

Also, when I receive it, how do I check the FARM? I have watched this video.

Thanks!

https://www.youtube.com/embed/n8WBnoPvkTw

2638
 
 
The original post: /r/datahoarder by /u/elia_firenze on 2025-03-13 18:03:52.

I have to download all the audios that I have saved in my Instagram profile (see attached photo) thanks to everyone

https://preview.redd.it/380w3tzcyhoe1.jpg?width=1947&format=pjpg&auto=webp&s=0405e311f3986cdeceeb07cf96ef13cee083eeb9

2639
 
 
The original post: /r/datahoarder by /u/treasoro on 2025-03-13 17:33:29.

We all heard about seagate drives hitting the market with modified SMART values.

I recently bought a used 12tb ironwolf pro drive which i suspect is fake. SMART indicates 1 power on hour, FARM power on 36 hours.

Is it legit drive or fake drive?

I tried to study the fakes and how they can be recognised and it turns out those fake ones will not redirect to seagate verify page when scanning QR code, instead of chineese warrant check page.

My drives fails at authenticity check.

https://preview.redd.it/bk24gmmvshoe1.png?width=960&format=png&auto=webp&s=ef1df52f43dfea12a4bd7c5a4bbfacfead0676f2

2640
 
 
The original post: /r/datahoarder by /u/BoJackHorseMan53 on 2025-03-13 11:36:35.

Hello fellow DataHoarders, I was a big data hoarder myself, hosting 3 peta bytes of data, until Google started to crackdown. Then I had to delete all the data from Google Drive.

I like the idea of lemmy. We can have our own reddit, free from ads and billionaire imposed rules and censorship.

I also believe in preservation of internet data.

The biggest problem with hosting a lemmy instance is data preservation. The lemmy instance with all of it's data should be available on the internet 10 years from now.

I think the best candidate to host a lemmy instance would be usenet providers as they already have large infrastructure and they're know to never delete user data. And this is what usenet was intended to be, but now, no one uses usenet.

If we can't get any usenet provider to host a lemmy instance, would any of you with a stable setup like to host a lemmy instance?

The biggest problem with hosting a lemmy instance is data preservation. The lemmy instance with all of it's data should be available on the internet 10 years from now.

The second biggest issue is getting promotion of the site and getting users to switch from reddit to lemmy.

If some of you are interested in the idea, we can work together to host our own lemmy instance with proper documentation to make it easy to onboard nes users.

2641
 
 
The original post: /r/datahoarder by /u/five0first on 2025-03-13 17:30:17.

Hey all,

I have a nvme that I carry around with me and I use on my various pcs. It has portable apps on it so that no matter where I go, everything is exactly as it was wherever I am. My question is does such a thing exist where an enclosure for an nvme drive has it's own docking station? I'm imagining like a little vertical box that has a usb c male end embedded down inside (think like a Nintendo Switch dock) where I can just slot the external enclosure into in order to connect it to my PC. It could be considered a nonissue to just let the external drive lay on top of my desk and have a cable running over to it, but I think it would be neat and tidy to have a dock like that instead.

2642
 
 
The original post: /r/datahoarder by /u/quit_smoking1 on 2025-03-13 17:25:05.

I basically have two questions if anyone's got the time...

  1. If I'm specifically looking at Server Parts Deals for used 3.5" drives, would it be better to go with "Manufacturer Recertified" vs. "Seller Refurbished" (ignoring price)? Manufactrurer Recertified has 2 year warranty, so that alone might be worth it, but I've heard horror stories about companies like WD and Seagate not even checking or doing anything when they get return drives and just immediately turning around and selling them as "recertified". Whereas maybe Server Parts Deals themselves put at least a little more effort into making sure their "refurbished" drives are good to go? Obviously, it might just all be a guessing game, but wondering if anyone has any firsthand experience or advice.
  2. If I have a basic 2-bay Sabrent HDD dock, is there any issue using both an 18TB and 20TB drive at the same time? Assuming the dock can accommodate up to 20TB (which it does) and that there's no mirroring/backup going on? I'm not trying to combine the drives, either; I just want to use two drives in the same dock but have them be separate on my PC.
2643
 
 
The original post: /r/datahoarder by /u/BreadfruitExciting39 on 2025-03-13 15:32:54.

I just found out I apparently have an issue with the allocation unit size on my NAS, and folders with many small files take up an unreasonably large amount with respect to the "size on disk". I am starting to run low on space on my NAS, and cannot afford to upgrade drives at the moment, so I am looking for ways to trim the fat. From what I understand, too large of allocation units can make small files waste a ton of space.

What I don't understand is: if I delete a folder that takes up a huge amount of 'size on disk', the free space on the drive only increases by the file size that was deleted. For example, I have a folder that is ~400mb but reports taking up ~46gb on the disk. I would expect deleting that folder to provide me with 46gb more free space, but it only increases free space by 400mb.

Can anyone help me figure if it's worth the time to find these directories and compress them in order to save the 'size on disk'? Or will it not make much difference anyway?

https://preview.redd.it/5k3jxv2n7hoe1.png?width=322&format=png&auto=webp&s=f2c244d6486b9ba7cfd13988c75dcd760fa11d74

2644
 
 
The original post: /r/datahoarder by /u/Neurrone on 2025-03-13 15:24:22.
2645
 
 
The original post: /r/datahoarder by /u/True-Entrepreneur851 on 2025-03-13 13:48:13.

If anyone could help me into this please. Here is the issue: rclone was moving files from remote to my Synology without any issue. But since last weekend it stopped. I tried to recreate the scheduled task, everything, …. Task seems to be running without any data. I logged to my NAS thru Putty, running the command was working like a charm. Then went to my scheduled task, no change but just run it and …. It works. What am I missing please ?

Command in the scheduled task is : rclone move remote:share /vol1/share -P -v Task set with root user of course.

2646
 
 
The original post: /r/datahoarder by /u/tmitifmtaytji on 2025-03-13 13:42:28.

I have 2x WD101EDBZ right now, and I am thinking about either getting two more of the 10GB Elements drives and shucking, or just getting two WD101EFBX which seem to be pretty similar, and using them all for the same volume.

What's my best option? Will the Elements drive likely have changed in the couple years since I first got them? I'd rather have 4 absolutely identical drives but if close enough is good enough I might rather go for the sure thing of the Red Plus rather than chances on what is in a shucked drive.

2647
 
 
The original post: /r/datahoarder by /u/Huihejfofew on 2025-03-13 07:32:41.

I have already copied a folder with my files to another drive using file explorer not teracopy. I've just got teracopy, i know i can test each folder to get a hash file for each folders. But with a hash file save for each folder how do i get teracopy to compare both hash files to confirm if the files are the same?

2648
 
 
The original post: /r/datahoarder by /u/Spinelli__ on 2025-03-13 05:30:17.

I'm thinking of replacing my WD Ultrastar HC520 (SATA) 12 GB HDDs with Seagate Exos 2x14 Mach.2 (SATA) 14 GB HDDs. I thought the Seagate would be around at least as fast, if not a touch faster - and it is in sequential r/W - but in random 4K QD1 T1 tests, according to a video review of the Seagate 2x18 (even slightly faster than the 2x14), my WD seems to completely & utterly obliterate the Seagate to the point that I'm skeptical of the results and rubbing my eyes in disbelief.

I've included a picture of the tests but here's a breakdown.

My WD is performing around 3.5x - 4.0x faster in 4K random reads and around 1.6x - 1.7x faster in 4K random writes.

For a HDD to be around 3.5 - 4.0x faster in something than another HDD, that's like 20 years or so of progress, isn't it? Normally drives are like 20% faster here, 5% slower there, etc., not 250-300 % faster than another competitor's drive.

Is the WD Ultrastar really 3.5x - 4.0x faster in 4k random reads and 1.6x - 1.7x faster in random writes? This seems unbelievable to me. Even "unbelievable" is an understatement. There's just no way.

System:

  • Motherboard: Asus Z790-A Strix D4
  • CPU: Intel i9-14900KS
  • GPU: Nvidia RTX 3070 Ti
  • RAM: G.Skill 2x 16 GB Samsung B-die dual-rank 4200 MHz, 16-16-16-32, fully tuned (secondary, tertiary, etc. timings)
  • OS: Windows 10

P.S. I have my WD drives connected via USB 3.2 via a very cheap USB 3.2 HDD enclosure.

https://preview.redd.it/yqocksll7eoe1.jpg?width=3160&format=pjpg&auto=webp&s=a574ead751eb139f9dfefa2438470e4935ae9950

2649
 
 
The original post: /r/datahoarder by /u/skiprecon777 on 2025-03-13 05:29:39.

Not sure if this is the right place but I couldn't find any dedicated subs for DC++

I started using DC++ and I had the client set up to establish it's own connectivity settings. In my router I can see the port forwarding rules it has created.

It allows me to connect to the hub in Active mode, and connect to users, but after ~10 minutes I lose the ability to connect to users directly. If I restart DC++ the problem is corrected but will again happen after several minutes.

Im trying to get some advice on how I can set up connectivity/port forwarding settings so the connection remains established/uninterrupted.

Or, if there's a better place I can go to ask about this id appreciate being pointed in that direction.

2650
 
 
The original post: /r/datahoarder by /u/trenchwork on 2025-03-13 05:20:18.

I plan to buy some storage to start better equipping my current home pc (which is currently mostly used for competitive gaming aka old/low graphics, intensive browsing/research, video capture etc.) to handle the large amount of irreplaceable media I already have collected, and to soon begin "archiving" (no need to correct, I know buying drives is not actual archiving) orders of magnitude more video and photos from many sources including my own, which I will be accessing/loading, editing, and moving around a lot. Eventually the goal will be to assemble it all into a project which can be insured on other machines, hosted etc. I don't know that the totality of the data will exceed, let's say 50tb, but no way to know. In the meantime I would like use my few 1tb SSDs to start collecting and working on the data, and a single large high quality interal HDD to both constantly mirror the SSDs and amalgamate what is finished and ready to stow away from the SSDs. From this internal HDD I will be taking consistent external backups. I don't have the money to go multiple large HDDs in RAID right now, so I am thinking of something in the 8-16tb range to get started, since for all I know total data I end up keeping could randomly end up less than that..

I have been gathering inexpensive/on sale SSDs but am now looking into a single large HDD and confused by the pricing on these items, for example, as it relates to the performance/reliability gap from desktop to enterprise hardware;

https://www.amazon.com/Seagate-BarraCuda-Computers-ST10000DM0004-Refurbished/dp/B07MWCVMXJ#customerReviews

vs

https://www.amazon.com/Seagate-Enterprise-Hyperscale-7200rpm-Improved/dp/B0CF5XVHMS

vs

https://www.amazon.com/Seagate-Enterprise-Cache-Internal-ST10000NM0016-Refurbished/dp/B07H8PHXYH

Because this is my personal machine, I would also be tagging non-competitive games and active data of other kinds on the HDD until I need more space, so could use some guidance on what I should be looking for performance wise. I would like to stay within a $180 maximum price for this single HDD.

view more: ‹ prev next ›