It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
1926
 
 
The original post: /r/datahoarder by /u/KingCornWallis on 2025-04-19 02:10:33.

I am building a Media Server...Close to a thousand CDs and DVDs that need an accessible digital home. Don't imagine more than 2 people would ever be using the server simultaneously, and even that would be a stretch. More like 1 person a few times a day.

I believe the path forward is something along the lines of Dedicated Full Tower Desktop -> TrueNAS -> JellyFin with MakeMKV + Handbrake. I am competent with technology, hate subscriptions, and revere ownership.

My first question is whether it is advisable to dual-use a TrueNAS setup as a multi-disc ripper station. When I do rip discs...it would be many concurrently for an extended period of time. With all of the tentative drives in the system already I am worried about about the strain on disk R / W or I/O operations...if that's a thing. As for ports, any optical drives would likely go into an internal USB hub...leaving SATA for the drives.

My Second question is one I have already researched and can't seem to find anything on: I already have 10+ 9.5mm (Laptop) Slot Loading optical drives that I would like to use if possible. They all use SATA slim-line connections, and as stated they would probably have adapters to go into a USB hub. 9.5mm drives are uncommon enough on desktops, but these are also Slot Loading...that means no tray comes out, you just slide the disc in (like a PS5). Has anyone found a bracket for using one of these drives in a 5.25" bay? Maybe something 3D-Printed? Even if it was for a normal tray loading ODD that would be fine, but all I have found is this expensive Syba Adapter with other goodies (and it is for 12.7mm)

(I'm aware that without some sort of custom front bracket the slot loading drives will look quite ugly)

If you see any pitfalls or tips for my tentative setup feel free to share as well, thank you.

1927
 
 
The original post: /r/datahoarder by /u/ukralibre on 2025-04-19 02:06:57.

Which is most reliable hdd?

1928
 
 
The original post: /r/datahoarder by /u/e7615fbf on 2025-04-18 23:35:00.

As many of you know, the 3-2-1 backup strategy is the ideal for data protection, but it's not exactly affordable to pull off in practice for large amounts of data. As such, I scaled up my raw onsite storage before I really had a full 3-2-1 in place, so I've been going back and adding reinforcements to my homelab over time and I'm happy to report I'm finally in a reasonably secure place -- though some calculated compromises had to be made. I just wanted to share my setup for anyone trying to find a practical way to add this level of security to their lab.

This is my setup currently:

  • My primary server runs TrueNAS with everything in a mirror configuration. It's just kind of the way my lab grew -- I started with 2x4TB NVME drives, then 2x6TB Toshiba HDD's, and recently 2x24TB Iron Wolf Pros. Mirroring (and RAID) is not a backup strategy, but it does add redundancy.
  • My most valuable / irreplacable data has all been etched onto a stack of M-discs and put in a fire-resistant safe at another location about an hour away. The $ per GB on those is quite high, so I had to prioritize what went on them.
  • For cloud storage, I started using Storj, which integrates very nicely with TrueNAS. It's surprisingly cost-efficient, so I can back up quite a good amount. My entire homelab configuration, and anything that is not easily replaced, is on Storj. In the event of a catastrophic failure, I can recreate most everything from what's on there. This could also, in theory, scale easily with my income. If I'm in a place to afford more, I can just throw everything on Storj, for example. It would take like 10 seconds to set up in TrueNAS.
  • I run Nextcloud and have most of my data synced locally on some of the devices connected to it (e.g. on my laptop, but not my phone). This adds another small redundancy layer for data I use frequently. If my server goes down, I at least still have a copy of the data on my laptop.
  • Finally, I compromised on my Jellyfin media library - it's too big to backup on either Storj or M-discs for now (just from a cost perspective), so I've resigned myself to the fact that I could potentially lose it. This is what sits on the big boi 24TB drive. On one hand, most of it is replaceable, if ya know what I mean. I could pull the manifest from my Jellyfin config (which is backed up on Storj) and gradually re-aquire the majority of the media content. It would be a pain, but it's doable. Also, the nice thing about Iron Wolf Pros is that they come with a data recovery service for the duration of the warranty, so that's another small layer of security that could theoretically come in handy (though it is unlikely).

With this all in place, I've finally cut the cord on any remaining subscription services I had and I'm finally an independent data hoarding homelabber :)

1929
 
 
The original post: /r/datahoarder by /u/Queendevildog on 2025-04-18 23:15:35.

How would you go about saving these databases. Just a regular fed employee hoping someone here has some idea on how to download and store this data

1930
 
 
The original post: /r/datahoarder by /u/djliquidice on 2025-04-18 22:43:39.

I own 2x DS3617xs, a 1821+ and 1521+ and am fed up with Synology's continued push away from consumers.

Saw this today and am considering preordering one of them. Many will consider it too expensive, though I'd rather spend my time working on other creative tasks outside of piecing together yet another computer.

https://youtu.be/eRd0wAVzals

1931
 
 
The original post: /r/datahoarder by /u/Normal_Psychology_73 on 2025-04-18 22:03:16.

I am upgrading the two disks in my HP Microserver. I am considering WD Red Plus or Pro either 6TB or 8TB, Raid 1 config. Reading comments from ~6 years ago the overwhelming feeling is that they are junk. Recent reviews indicate they are fairly good. What is the real truth? What seagate drives are equivalent/better? Baracuda, Ironwolf?

This is for a home NAS without heavy demand, so 5400 RPM is OK, Reliability/longevity is most/more desirable. Thoughts please?

1932
 
 
The original post: /r/datahoarder by /u/Weekly-Bag64 on 2025-04-18 21:18:40.

Did you hear about the lawsuit for Internet archive? 700 million dollars It’s so bogus because that 78 music archive has been up for years and just now this year they’re asking for copyright infringement? Sus from what I know of copyright after a period of time . It becomes royalty free and I’m certain that Internet archive wouldn’t put non-royalty free music Just for this reason on their website so I think it’s just because these record labels wanna make trouble and ruin everything for everyone just because they can

1933
 
 
The original post: /r/datahoarder by /u/Robot11125 on 2025-04-18 20:30:07.
1934
 
 
The original post: /r/datahoarder by /u/ConfusionOk4129 on 2025-04-18 20:27:28.
1935
 
 
The original post: /r/datahoarder by /u/Shanus_Zeeshu on 2025-04-18 19:12:53.

It’s actually wild how the real struggle these days isn’t “finding information” - it’s trying not to drown in it.

You start with one simple question. You open one YouTube tutorial, then one article, then a few PDFs... and before you know it your brain is fried and the problem still isn’t solved.

It’s not even about being smart anymore - it’s about surviving the research rabbit hole long enough to actually do something.

Funny how we have more resources than ever, but finishing things somehow feels harder.

1936
 
 
The original post: /r/datahoarder by /u/Top_Change_2390 on 2025-04-18 19:08:33.

Hi all,

I am a app developer by profession so only have a limited to fair exposure towards hardware (Sorry!)

I may sound a noob in this sub, so apologies for that.

I have nearly a TB of data (maily photos and my code but my code have a git backup so not much worried about those). But the photos are invaluable as I have lost around a decade of photographs due to a service person formating my disk.

Now I am backing up the photos on a Sandisk Extreme SSD and two pen drives, which seems like a bad idea. So thinking of an automatic backup solution which somewhat starts towards the 3-2-1 strategy.

I have an old Thin-client with an i3, and bot worried about speed for most of the redundant backups (only the primary one which family usually access).

Can anyone help to device a strategy - I am not looking for solutions which as expensive like a NAS at this point as I will gradually move towards that but at this pointnoyt able to spend much.

Please don't laugh, but I thought of connecting some 512GB pendrives to the system and do an autobackup daily as a redundancy solution :-) because that also serves my purpose. All I need is a primary disk which we can access daily (we can do that from PC but access through router/wifi is a plus), and a redundant backup solution which ensures my photos aren't lost.

My photos are nearly 400GB only at present but might increase exponentially as we do a lot of travel now.

NB: I have done some search but most posts are above my technical knowledge :-(

Thanks

1937
 
 
The original post: /r/datahoarder by /u/M5DMD on 2025-04-18 17:15:14.

hello i have an old pc with i5-4460 and R9 280 GPU and i'm looking to turn that into a home server/NAS for data storage and media streaming inside the house and remotely

however i noticed that my mini ITX case (node 304) is missing 2 brackets for 3.5'' HDD, meaning that unless fractal design has those brackets avaiable, worst case scenario is that i won't be able to add any more HDD to it.

Would an external SATA enclosure that connects to the pc via USB be sufficient or should I look for a new build since all components are so old anyway?

thank you

1938
 
 
The original post: /r/datahoarder by /u/Vancapone on 2025-04-18 15:48:01.

I’m planning to build my first NAS and was considering the Synology 423+, since I’m mainly going to use it for media (films and music) and storing personal files.

Do you have any recommendations on how to make the most of my budget? Maybe there are better alternatives to Synology—I’d be grateful for any tips!

1939
 
 
The original post: /r/datahoarder by /u/SkidRowCFO on 2025-04-18 14:51:14.

The CFPB just laid off almost 90% of its workforce, and has stated they're reorienting their focus and efforts. Although it's federally mandated they can't delete/remove any data or information, I trust that less than a fox in a henhouse.

I work completely in the personal finance space, so obviously I'm concerned. What's the best way to preserve those resources if it's a lot of PDF and .doc?

1940
 
 
The original post: /r/datahoarder by /u/jimmysqn on 2025-04-18 14:34:36.

Hey everyone! I made a complete tutorial on how to install and use yt-dlp + ffmpeg to download YouTube videos in the highest possible quality.

I tested it myself (on Windows), and it works flawlessly. Hope it helps someone out there :)

━━━━━━━━━━━━━━━━━━━━

📘 Full tutorial in English:

━━━━━━━━━━━━━━━━━━━━

How to download YouTube videos in the best quality? (For real – free and high quality)

🔧 Installing yt-dlp:

  1. Go to https://github.com/yt-dlp/yt-dlp?tab=readme-ov-file or search for "yt-dlp" on Google, go to the GitHub page, find the "Installation" section and choose your system version. Mine was "Windows x64".
  2. Download FFMPEG from https://www.ffmpeg.org/download.html#build-windows and under "Get Packages", choose "Windows". Below, select the "Gyan.dev" build. It will redirect you to another page – choose the latest build named "ffmpeg-git-essentials.7z"
  3. Open the downloaded FFMPEG archive, go to the "bin" folder, and extract only the "ffmpeg.exe" file.
  4. Create a folder named "yt-dlp" and place both the "yt-dlp" file and the "ffmpeg.exe" file inside it. Move this folder to your Local Disk C:

📥 Downloading videos:

  1. Open CMD (Command Prompt)
  2. Type: cd /d C:\yt-dlp
  3. Type: yt-dlp -f bestvideo+bestaudio + your YouTube video linkExample: yt-dlp -f bestvideo+bestaudio [https://youtube.com/yourvideo](https://youtube.com/yourvideo%60)
  4. Your video will be downloaded in the best available quality to your C: drive

💡 If you want to see other formats and resolutions available, use:

yt-dlp -F + your video link (the -F **must be uppercase**!)

Then choose the ID of the video format you want and run:

yt-dlp -f 617+bestaudio + video link (replace "617" with your chosen format ID)

If this helped you, consider upvoting so more people can see it :)

━━━━━━━━━━━━━━━━━━━━

📗 Versão em português (original):

Como baixar vídeos do Youtube com a melhor qualidade? (de verdade e a melhor qualidade grátis)

Instalação do yt-dlp:

1 - https://github.com/yt-dlp/yt-dlp?tab=readme-ov-file ou pesquisar por "yt-dlp" no Google, achar ele no GitHub e ir até a área de "Installation" e escolher sua versão. A minha é "Windows x64" (o programa é código aberto)

2 - Baixe o FFMPEG https://www.ffmpeg.org/download.html#build-windows e em "Get Packages" escolhe o sistema do Windows, e embaixo escolha a Build do Gyan.dev. Após isso, vai abrir outra página do site do Gyan e escolha a última build "ffmpeg-git-essentials.7z"

3 - Abra o arquivo do FFMPEG compactado, abre a pasta "bin" e passe somente o arquivo "ffmpeg.exe" para fora.

4 - Faça uma pasta com o nome "yt-dlp" e coloque o arquivo "yt-dlp" que foi baixado primeiramente junto com o "ffmpeg.exe" dentro da pasta que criou e copie essa pasta com os 2 arquivos dentro para o Disco Local C:

Baixando os vídeos

1 - Abra o CMD (use apenas o CMD)

2 - Coloque o comando "cd /d C:\yt-dlp" (sem as aspas)

3 - Coloque o comando "yt-dlp -f bestvideo+bestaudio + o link do vídeo que você quer baixar" e dê um enter (*Exemplo: yt-dlp -f bestvideo+bestaudio linkdoyoutube)

4 - Seu vídeo será baixado com a melhor qualidade possível na pasta no seu Disco Local C:

Se precisar baixar em outros formatos e ter mais opções de download, é só tirar o "bestvideo+bestaudio" do comando e colocar apenas assim "yt-dlp -F + link do video" o "-F" ali PRECISA SER MAIÚSCULO!!! Após isso, vai aparecer uma lista grande de opções de formatos, resolução e tamanho dos vídeos. Você escolhe o ID do lado esquerdo do qual você quer, e coloca o comando por exemplo "yt-dlp -f 617+bestaudio + linkdoyoutube"

Se isso te ajudou, considere dar um upvote para que mais pessoas possam ver :)

Tutorial feito por u/jimmysqn

1941
 
 
The original post: /r/datahoarder by /u/Loose-Mushroom4153 on 2025-04-18 14:01:50.

I'm hoping someone here can help figure out an issue I'm having transferring files onto LTO tape via the Cannister app. This is on a Mac fyi.

Here is what's happening:

I load a portable SSD with all of the file folders I want to transfer to the LTO. I plug the drive via usb into the computer that is connected to the LTO.

I drag and drop the folders from the drive onto the Archive screen in the Cannister app. It'll say X amount of folders and the total size. Cool

I go ahead with the transfer but when it's finished and shows up in the LTO tracking, the file path is always LTO Tape -> SSD Drive Name Parent Folder -> The Xferred Folders.

How do i stop Cannister from making the transfer drive a Parent Folder for the things I'm transferring? I want the file path to just be LTO Tape - > List of all the Xferred Folders.

It doesn't seem to matter if I drag each folder individually, or all at once, or even just drop in the SSD image from the desktop. I always end up with the transfer drive name being a parent folder containing all of the folders I transferred.

Any thoughts, help, solutions? Please and thank you!

1942
 
 
The original post: /r/datahoarder by /u/WorldEnd2024 on 2025-04-18 11:35:36.
1943
 
 
The original post: /r/datahoarder by /u/OfficialTornadoAlley on 2025-04-18 10:19:01.
1944
 
 
The original post: /r/datahoarder by /u/AnonDresserKiller on 2025-04-18 07:03:42.

Hi! I have a project I’ve been chipping away slowly at that I need some advice on. I’m scanning and digitally recording old books from some historical dog clubs that haven’t been properly recorded yet. I have books, magazines, club meeting recordings, and thousands of varied documents and photos to record.

So far I have been slowly chipping away by scanning all the flat documents and photos at my spouse’s job using their nice office scanner. Recently due to flood they lost their scanners so I’m looking to buy my own. The next part of my project will mostly be small, dense yearbooks. They’re hundreds of pages, generally 250-400, short and dense. For the older ones and any volumes I don’t have copies of I am not willing to destroy them, so a flat scanner won’t work I don’t think.

I’ve been looking at the CZUR line of products. I like the idea that they are portable, but not of them seem like they produce exceptionally high quality images. These yearbooks have half or full page photos on nearly every page. I have nearly 90 to record.

Is building my own rig really my best option? My budget is under $300 so that doesn’t seem feasible right now. I’m autistic and tend to overthink things and never get started. I want to do the best job I can do, within reason.

What should I do? Just buy one of the CZUR scanners within my budget? Keep saving for a digital camera? Something else?

1945
 
 
The original post: /r/datahoarder by /u/Adept_Honeydew7208 on 2025-04-18 02:17:08.

Hey r/DataHoarder,

Sharing a tool I built that might be useful for archiving online media: SocialSaver.

It's a free, open-source desktop GUI (Win/Mac/Linux) sitting on top of yt-dlp and ffmpeg, designed to make downloading content for your archives a bit easier.

Relevant Features:

  • Uses yt-dlp for broad website compatibility.
  • Specifically supports downloading entire playlists and channels for bulk archival.
  • Allows selection of format (MP4, MKV, MP3, FLAC, etc.) and quality to manage storage space.
  • Aims for reliable downloads for offline storage.

Website / Download:

https://socialsaver.site/

Could this fit into your archiving workflow? I'm looking for feedback from users who need to download content reliably, especially in bulk. Let me know your thoughts or suggestions!

1946
 
 
The original post: /r/datahoarder by /u/shiftdelete76 on 2025-04-18 01:56:24.

How can i bulk download my favorited media on booru sites with tags included?

Would be possible to download them in a way where they are at original size and named automically like "char_name, artist" with the rest of the tags simply going inside tags metadata?

Over the years my favorites got overcrowded and i want to do a clean-up but i want to keep some of the stuff.

1947
 
 
The original post: /r/datahoarder by /u/Popular-Ad-9134 on 2025-04-18 13:45:43.

I currently have a DS224+ as mediaserver running Plex with a Seagate 12TB enterprise drive and a WD Ultrastar 520 14TB running RAID0. I am aware of the lack of redundancy that is a personal choice. Recently I attached a external SSD to move my docker containers to since the system was running sluggish during high IO. Now since I am also optimizing media for transcoding I would like to upgrade to a MiniPC.

I am wondering if it's a better choice to sell the NAS and get a DAS like the Terramaster D5-300C so it can connect over USB 3.1 with my MiniPC. The MiniPC will do loads like transcoding when I am away from home or optimizing my libraries by re-encoding audio to AC3. I might need more storage in the future.

1948
 
 
The original post: /r/datahoarder by /u/ARCCSCX on 2025-04-18 12:49:44.

Hi everyone!!!

I'm currently working on a deepfake detection research project, and I’m trying to access the original DFDC dataset from the DeepFake Detection Challenge. Unfortunately, the official Meta links seem to be down or broken.

If anyone has a mirror link, archive of the dataset they’d be willing to share , I’d really appreciate it.!!

Thanks in advance!!!!

Henry

GS in Cloud Computing at Franklin University

Focused on adversarial AI and deepfake forensics

1949
 
 
The original post: /r/datahoarder by /u/Lunam_Dominus on 2025-04-18 11:24:06.

I'm planning to buy two 16 TB Exos drives in the near future for my personal file backup (photos, movies, music, projects and so on).

I'm thinking of using one drive in my PC daily, copying data to it for storage, and syncing it to the second every 4 weeks, which would be in cold storage between those syncs.

Does a setup like this make sense? I'm don't care if I lose 4 weeks of data - I mainly want the old files to survive.

1950
 
 
The original post: /r/datahoarder by /u/JwustGiveMeAName on 2025-04-18 10:43:44.
view more: ‹ prev next ›