It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
2301
 
 
The original post: /r/datahoarder by /u/0nlythebest on 2025-04-04 17:18:08.

Hello,

I recently picked up a ton of hard drives from an acquaintance.

8TB, 12TB, and 18TB Hard drives. He said he wiped them all and reformatted. He was using an external hard drive enclosure via USB, and took some photos with CDI (Crystal Disk Info). I received them and wanted to check CDI on them myself. Everything works fine except the 12TB models, no reading at all, theyre not even recognized in bios or CMD.

So I asked him to send me the CDI pictures of those 12TB models and they say Interface: UASP (instead of serial ATA like the rest of them). I googled it, and read that it means USB Attached SCSI Protocol, also read a little bit about it. But everything i'm reading basically makes it sound like this interface only applies to external hard drives. So why would this internal SATA hard drive have UASP listed as the interface, and is it possible to convert it to standard interface to use as an internal hard drive with direct sata to my motherboard ?

the 12TB hard drives in question are these: they are from a datacenter.

https://www.amazon.com/HGST-Ultrastar-HUH721212ALE600-3-5-Inch-Internal/dp/B07PF1TVND

Any input appreciated!

thanks

2302
 
 
The original post: /r/datahoarder by /u/sunburnedaz on 2025-04-04 16:31:54.

Im currently manually using Treesize Pro for my deduplication needs but its lacking a feature I really want.

I would like to set a "source of truth" and then have the tool run over selected locations looking for files that are duplicates from that "Source of Truth".

Is there software out there that would have tha feature

2303
 
 
The original post: /r/datahoarder by /u/ignoble93 on 2025-04-04 16:06:19.

Been using Streamlink and never encountered video/audio sync issues until the streaming service decided to separate the video and audio streams. So I now use this command (see below) but until now there are occasional outputs that aren't in sync. Also, some files have incorrect timestamps and missing video frames towards the end. I am familiar with python but Streamlink is too complicated to modify. Can somebody help me what should be the correct command?

command = [
        'streamlink',
        '--url', url,
        '--default-stream', 'best',
        '--output', output_file,
        '--stream-segment-threads', '5',
        '--logfile', log_file.replace('.txt', '_hls.txt'),
        '--loglevel', 'trace',
        '--ffmpeg-ffmpeg', r'C:\ffmpeg\bin\ffmpeg.exe',
        '--ffmpeg-verbose-path', log_file.replace('.txt', '_mux.txt')
    ]

2304
 
 
The original post: /r/datahoarder by /u/hollywoodhandshook on 2025-04-04 13:36:46.
2305
 
 
The original post: /r/datahoarder by /u/PricePerGig on 2025-04-04 07:42:18.
2306
 
 
The original post: /r/datahoarder by /u/manzurfahim on 2025-04-04 07:22:52.

I recently (18th March) purchased a 20TB Seagate drive from serverpartdeals, it was $255.84 total (ST20000NM007D).

I was thinking of getting another one yesterday and saw that they increased the price to $259.99 (excluding tax).

Not sure what to do, I thought I'll decide tomorrow. I just checked again, and the price is now $304.84 total ($279.99 before tax)

Seagate Exos X20 ST20000NM007D 20TB SATA 3.5" Recertified HDD — ServerPartDeals.com

In less than three weeks, the price was hiked almost $50. 16TB drives were $179, now they are $229.

Is this happening because of the new tariff?

2307
 
 
The original post: /r/datahoarder by /u/umataro on 2025-04-04 06:58:52.

For 3 days I've been trying to make the decision. Every few hours, I prefer the other one. To clarify, if I went with individual drives, 1 would be in nas, 1 in backup nas, 1 at a friend's house. I take and replicate frequent snapshots so maximum data loss would be 15 minutes or 1 hour (I adjust the frequency manually based on what I'm currently working on). I would be grateful for some external input on this.

2308
 
 
The original post: /r/datahoarder by /u/TheRealHarrypm on 2025-04-04 06:00:40.

The long-awaited classic demonstration tape for using VideoPlus, decided to throw up the captures as a reference demo, this is transferred for Andy (Robust Reviews) for his video on the actual hardware remote which is a great watch for a little bit of home recording history.

This is a re-upload & re-post as there was a scripting error with the deinterlacing (BFF instead of TFF processing)

2309
 
 
The original post: /r/datahoarder by /u/Legitimate_Pea_143 on 2025-04-04 05:49:11.

I've tried using anydriib countless times now and it's never actually worked. I download the file (usually a zip or rar file) and it's always says the file is corrupt. i have NEVER had any luck using anydebrid or any other debrid site.

2310
 
 
The original post: /r/datahoarder by /u/HopeThisIsUnique on 2025-04-04 05:43:13.

So many years ago I picked up a Nimbie CD robot with the intent of doing my library. After some software frustrations I let it sit.

What options are there to make use of the hardware with better software? Bonus points for something that can run in Docker off my Unraid server.

If like to be able to set and forget doing proper rips of a large CD collection.

2311
 
 
The original post: /r/datahoarder by /u/TeacupTenor on 2025-04-04 01:42:34.

I've been using GoodSync to backup data for a number of years. I use a two-way sync so that the two drives I copy back and forth contain the same data.

I've noticed that periodically GoodSync's backup space estimate goes way up in my target drive. When I check what it wants it to sync, I see a list of basically the majority of my files. I've noticed this happen with portable hard drives, and today, for the first time in a portable Samsung Shield rugged SSD.

I used to believe that it was some kind of break down in the hard drives themselves, but now I'm not sure, since the SSDs have never given me trouble before.

Has anyone else experienced this? Is there a setting that maybe I'm not using correctly that is somehow making GoodSync "refresh" the data?

Thanks.

2312
 
 
The original post: /r/datahoarder by /u/UnassumingDrifter on 2025-04-04 01:10:36.

I'd like to back up my main file server to another machine I built. I have about 40TB of data: 80% is large-ish media files, 20% is documents, photos and smaller files. I'd like a solution that can take that into account when setting up the backup. Currently I'm using, and successfully, Duplicati. It's free and open source and I like there is a Web UI even if it's kinda plain. What I don't like is that it isn't super fast. It will spike to 3.5Gb/s network thruput for a few seconds, then jump down to 1Gb/s or less for a minute or so. I am using a Threadripper 5955WX for the backup machine with a bcache backed RAID6 array. Based on fio test I should be able to sustain 3.5GB/s random writes and my file server can sustain that based on tests. What I think is happening is it appears that only 1-thread is being used for compression / etc. SO, I want something faster.

What I want: Speed - should be able to utilize hardware better. I'd like to be able to backup to local drive, not interested in cloud backup. I'd like it to work with smb shares. Docker would be nice but I'll settle for a local installed app as long as it works with openSUSE Tumbleweed. I don't mind buying something if it's reasonable price, but I do expect if it's a pay program it has a better UI than the free stuff. I do see Duplicacy has a free CLI but I'm more interested in something with a GUI, and preferably a Web UI so I can manage it remotely, so that's the Home Version. I'm not opposed, but I really don't know yet if it'll be more performant than Duplicati. Anyway, this got me thinking - if I'm willing to pay, what is out there? I know about Veeam but I tried a demo and ran into difficulties. It's been a bit so I don't recall what the issue was but I moved on.

What other "pay" backup applications should I consider? If there's a free one you can think of besides Duplicati I'm down. I did try some Borg backup docker UI container but I had issues. Again, maybe I'm the issue, but just getting that out.

2313
 
 
The original post: /r/datahoarder by /u/CantStandIdoits on 2025-04-04 01:07:51.
2314
 
 
The original post: /r/datahoarder by /u/TristinMaysisHot on 2025-04-04 00:32:42.

I'm trying to get a list of all files on a hard drive. For example on E: I have 5 folders and inside those folders are thousands of movies. There is also some sub folders inside the folders. What is the best way to go about getting a list of everything?

I tried doing this command i found on Google, but it doesn't do anything.

dir e:*.* /s /on > c:\filelist.txt

2315
 
 
The original post: /r/datahoarder by /u/fletchnuts on 2025-04-03 23:10:26.

I recently purchased and shucked two of the Seagate Expansion 28TB external drives (labeled as Barracudas), and put them in a Terramaster D4-320. The Terramaster site says the enclosure only supports up to 22TB, but these 28TB drives are working just fine.

This is just an informational post because I couldn't find any information the D4-320's support for larger drives.

The read/write performance of these drives is pretty good. I'm seeing about 240-260MB/sec.

2316
 
 
The original post: /r/datahoarder by /u/johnny_ringo on 2025-04-03 22:45:07.

Original Title: Question for the serious DHer's with 70TB of data+ How do you organize everything in your personal collection. And I mean everything- from email, to photos, to videos, to receipts, to unique app project files...


Photos, Videos, Large 3d data files, personal projects, mail backups... basically my life and creative work all in one spot. Sorting videos and photos by year makes sense, though it is tedious to rename every date + a quick descriptor. Then it gets REAL tedious to go through those odd folders that are 1TB of small files called "x-to sort later" Do you organize by filetype? by year? by big events? Last question, how do you know what files are just a waste to keep- like those thousands of .col files that Capture One weirdly creates? Thanks.

2317
 
 
The original post: /r/datahoarder by /u/burnthew1tchh on 2025-04-03 20:41:28.

Hey guys, so I've backed up my linux server via rsync and I was thinking of creating a cron job to backup new files, and backup files that were changed but I don't want the deleted files in the main server to be deleted in the backup. So it's not 1:1, I guess?

If I have files A, B, and C in my server and it's backed up. And files A gets deleted, B gets changed, and C remaings the same. When I do a backup. I want to retain A, B changes and C is not touched. I would like to continue using rsync if possible.

Sorry, english is not my first language. Adding 'Backup' flair but I know this is not a Backup setup. It's a hoard all the files setup. hehe

2318
 
 
The original post: /r/datahoarder by /u/SupermarketNew5003 on 2025-04-03 07:22:15.

I’m building a private home server to archive my media library for personal access.

I’ve run into handshake issues when using streaming devices through capture hardware — some signals don’t pass through properly.

Can anyone recommend a 4K HDMI splitter (HDMI 2.0 / HDCP 2.2) that works well with streaming boxes and capture cards, and maintains full signal without blank screens or errors?

Not redistributing or uploading anything — just trying to maintain a personal offline library.

2319
 
 
The original post: /r/datahoarder by /u/Slammernanners on 2025-04-03 03:00:01.

I need to search the big 2009 GC archive dump semantically for sites about a specific kind of music. I've tried Google site search for geocities.ws, but that feels a little jank. Is there an alternative before I go reinventing the search engine for this HUGE dataset?

2320
 
 
The original post: /r/datahoarder by /u/DearPlankton on 2025-04-03 22:06:45.

Someone is selling a bunch of them used for about $9-10/TB and offers 30 day warranty. I've been following them daily and their listing has been up for quite a while.

Should I bite? Should I ask them for anything before purchasing?

edit: Got it, not biting

2321
 
 
The original post: /r/datahoarder by /u/duchuy613 on 2025-04-03 21:52:12.

I'm currently using gen3x4 board, but I wanna get a 1TB gen4 SSD for the future gen4 board. The current best options I have (in my opinion) are:

  • Kioxia Exceria Plus G3: $53.5
  • WD Blue NS580: $54
  • Kingston NV3: $58
  • WD Black NS770: $64
  • Samsung 990 EVO: $67.5
  • WD Black SN850X: $77

I'm on a budget, so I'm looking closer at the Kioxia and the NS580. Are the more expensive options just marginally better? Or are they better by a large margin that justify the price difference? Alternative recommendations are welcomed too.

Edit: I mostly use the PC for gaming, but I do some modding so files are being moved around, most of them small in size.

2322
 
 
The original post: /r/datahoarder by /u/Gullible_Eagle4280 on 2025-04-03 21:49:14.

I've been wanting to upgrade to an N5 case for a while but shipping charges have been prohibitive. The shipping was as much or more than the case itself! (I should also add I am in Mexico) But today I was looking around Ali and just searched N5 NAS case and there are a ton of sellers selling them under various other brand names, many of which have FREE SHIPPING! It says "Shipped from the United States" in the listing I purchased mine from. It's listed as a "Top N5 NAS Case. So if you're in the U.S. (or Mexico) and shipping costs have been too much, hopefully this will help you.

https://preview.redd.it/ojga353pvose1.png?width=1690&format=png&auto=webp&s=72544c7bd699a69f21d06df774095376379bd575

2323
 
 
The original post: /r/datahoarder by /u/chubbyassasin123 on 2025-04-03 21:21:19.
2324
 
 
The original post: /r/datahoarder by /u/Infinate_ on 2025-04-03 21:17:02.

Hey everyone Hope this is okay to ask! I’m currently looking for a new 2TB External hard-drive. My current one broke. Thankfully it works still but barely hanging on by a thread and I need a new one to store my thesis work and other things.

I’m not knowledgeable on this all I know is I would like at least 2 TB storage for the harddrive and was told to look at one with USB-C connection Other than that I’m unsure about what else I should look for I currently have a 2TB sea gate and that’s all I know.

Anyone have any recommendations?

2325
 
 
The original post: /r/datahoarder by /u/joshd523 on 2025-04-03 21:06:48.

I’m working on an art piece and need a text file with the entire speech, doesn’t matter if there are minor spelling mistakes throughout. I used Jdownloader for the live stream, how do I get the text though?

view more: ‹ prev next ›