It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
3201
 
 
The original post: /r/datahoarder by /u/youcancallmejim on 2025-02-22 17:39:06.

Wife and I both like Mega. I have a free account but I want to get rid of my NAS and I think moving my files to mega makes sense as a replacement. I like photography as a hobby and it would be nice for that too. AFAIK there is no family plan. I was thinking keep my account free and she shares a dedicated folder with me where I keep all my stuff. Then we have 1 paid account and 1 free. What are the downsides to this plan? We could pay for 2 plans but it seems like throwing away money. Thanks!

3202
 
 
The original post: /r/datahoarder by /u/brian-the-porpoise on 2025-02-22 17:00:20.

HI all

So my trusted Synology DS214 play just celebrated 10 years this month! While it is still working properly, the 2 bay HDDs are filling up, and another annoyances stack up as well.

I have been reading posts on this and other subs for a while now, but the more I read the more I question what I would really need. While I am a software dev, and know just enough about hardware to build my own, I absolutely don't like spending my time on maintenance.

My use case is fairly simple:

  • backup for my phone files
  • serve media files to my PC (mostly docs and pics) and my jellyfin server (on an N100 beelink)
  • Probably don't need a fancy RAID setup. Important files are backed up daily, and losing my media library would suck, but it wouldnt be the end of the world.
  • Noise is not a big concern as the NAS sits in the pantry
  • Low power consumption
  • Would like WOL/auto-suspend

I would allow myself a budget of maybe 500 USD / EUR. But I'd really rather spend as little as possible and put more of this into other hobbies.

Given the above, what would you consider the best way to go?

  • Keep the DS214 and just upgrade the HDDs?
  • Use my current XPS 8940 and convert it to a NAS (would give me an excuse to finally build myself a new rig)
  • Building a custom NAS (I have seen the NAS-Killer guides)
  • Getting an OTS NAS like Synology or QNAP from ebay
  • Perhaps RPi + external HDD + OMV?
  • Something else? (Like using the Beelink as the NAS as well, etc.)

Appreciate the help!

3203
 
 
The original post: /r/datahoarder by /u/CLMCLM on 2025-02-22 16:56:26.

I’ve tried a few of the suggestions like yt.dl and some of the command line stuff but I’m a PC / python noob so couldn’t get it to work…

Are there any applications or ways to download a subreddits top posts? Or can anyone help with instructions for using python methods please :)

Thanks

3204
 
 
The original post: /r/datahoarder by /u/NCSGeek on 2025-02-22 16:53:25.

Hello. I'm looking for a temporary storage solution to hold my media until I can afford a NAS (Several years later. At least 4-5 years)

So far, I own this 3.5" HDD Enclosure, so I was thinking of buying a hard drive to put it in and use that as my temporary storage. For that, I was thinking about buying this drive as it's a great price for the storage size and I hear that Seagate Recertified is pretty good.

So my questions are:

  • Is my current idea a good one? (For the price / How reliable it'd be) - Can I trust this to keep my data safe for the holdover period (4-5 years est) until I get a NAS? I'd be keeping that drive in the enclosure the entire time.
  • If not, why and what recommendations would you have instead?

Any/all feedback or recommendations is very appreciated. Thank you!

3205
 
 
The original post: /r/datahoarder by /u/stlalphanerd on 2025-02-22 16:44:58.

I bought a 16-slot LTO-4 changer a while ago and had gone back and forth on whether or not I wanted to use a proprietary software to do my backups.

I decided that I didn't - so I had to figure out how to get things to work reliably for my archives. Specifically I wanted to use the LTO drive to create cold offline and offsite copy of critical media. Please assume for the remainder of this post that I have ascribed to whatever onsite/offsite dogma that you like and value, and that you and I agree 100% on all my calculations/valuations of cost/value/time in regards to this endeavor.

I have a 75tb Synology NAS and a Proxmox server thats running all my household stuff, as well as hosting the Ubuntu instance that my LTO drive is connected to. I dont have enough local disk space on my server to host the data I want to backup locally, I need to take it straight from the NAS.

Sooo...I started with trying to stream tar across the network. like this:

serverwithtapedrive# ssh user@nas.device.local "tar --tape-length=810000000 -cMf - '/volume1/FolderToBackup'" /dev/nst0

Straight tar streaming across the network.

If you have only big files - (mostly) no problem. Small files (images in my case) wreak havoc on the throughput. This quickly made it impossible for me to feed the LTO drive fast enough and was a failboat.

The interwebs told me that I could use mbuffer to create a buffer in memory that I could write to on the tape server from the remote. This sounded great so I tried it like this:

serverwithtapedrive# ssh user@nas.device.local "tar --tape-length=810000000 -cMf - --blocking-factor=512 '/volume1/FolderToBackup'" | mbuffer -m 4G -P 80 -s 512k -t -o /dev/nst1

And this seemed good-ish. This creates a buffer in memory on the server (4gig in this case) and when it hits 80% (-P 80) it starts dequeuing data (the tar) into the specified tape device. And this definitely smoothed out the network transport - but I still couldnt keep up with the write speed of the tape - even with a 50 or 100GB queue. Id fill it up to 80%, it would start to dequeue at 100-120MB/sec (LTO-4's throughput) and empty the queue, putting me back into underrun. Damnit.

So after quite a bit of machination - I ended up splitting the operation into server (where the tape drive is) and client (where the data is that I want to backup). And here's what it looks like:

First on the server - make sure you have a tape loaded into your device (mine is /dev/nst1) and execute the following:

server# mbuffer -I 8000 -m 200G -P 80 -s 512k | pv -L 40M > /dev/null 2>&1 | dd of=/dev/nst1 bs=512k iflag=fullblock status=none

This sets up mbuffer to listen on port 8000/tcp, creates a 200G memory buffer (I have alot of ram, just not alot of disk space - you dont need 200GB, experiment with sizes to see what works for your network), dequeues when the buffer hits 80%, uses a 512k blocksize THEN toss the data into pv - where we limit the dequeue to ~40Megabytes/sec (this is ~10megabytes/sec above the lower bound of safety for LTO-4 - this is how I ended up solving the buffer underrun) then throw the data to dd to block if up and chuck it into the tape drive.

Then on the client:

client# tar -cMf - --tape-length=1647820800 /volume1/FolderToBackup 2>/dev/null | pv -s 3622336905216 | nc tape.server.ip.address 8000

Tar up (specifying the length of my tape as 790G ) , pipe it to over to pv where I told it my total backup size (3.3TBish) so it will give me a rough eta and progress bar, and then pipe it to nc to shoot it over the network to the server.

And...? It works. I have rock solid queue depth and dequeue on the server, and I can backup effectively, my large sets to my tape changer.

I wrote this wall of text hoping that it might save someone else from spending as much time as I did trying to sort out how to make a cheap old perfectly good tape drive do work over the network.

3206
 
 
The original post: /r/datahoarder by /u/Warbly-Luxe on 2025-02-22 16:03:44.

I don't know if this is the correct sub for this, but it somewhat fits and I figure the people on this sub would have a few ideas.

Basically, I am trying to download a batch of class videos from my course website to be able to get access to the subtitles to read as a transcript either printed out or in a separate application where I can take notes. The embed layout is below, but it does not look like it's embedded from a popular site like YouTube or Vimeo.

https://preview.redd.it/pa60tbm5qpke1.jpg?width=2378&format=pjpg&auto=webp&s=0b6863145fe0d534310d5ef5ba545c5bfe04c87d

The website is built in Instructure Canvas, as far as I can tell. I've reached out to see if my professor has transcripts he can provide anyway, as I am trying to get the transcripts as a disability accommodation, but he's been little help in that regard and I've not heard back from him on this question I am asking this sub now. The subtitles are also auto-generated so a transcript might not even exist, only the slides.

Once I have the subtitles, I can open them in Subtitle Edit to format as needed before outputing it to a text document or the like. But yt-dlp, wget, and curl have not proved fruitful in downloading the videos and subtitles, and there is no option to 'save as'. Additionally, there is like 10 videos per week all 10 to 15 minutes, which is why the transcript button in the embed is less than ideal when it covers part of the video and there is no easy way to copy-past that text.

So, any ideas?

3207
 
 
The original post: /r/datahoarder by /u/caamt13 on 2025-02-22 15:46:18.

I've been going through all the guides that are allegedly up to date, but none of them seem to match up with the links that I'm facing/Vimeo must have recently changed how they serve private content.

Has anyone done this very recently and can provide a method that works?

3208
 
 
The original post: /r/datahoarder by /u/TheeEmperor on 2025-02-22 13:47:20.

Hello! I'd like to download the full 100GB .zim version of the January 2020 archive for use in kiwix but I cannot source it.

3209
 
 
The original post: /r/datahoarder by /u/d2racing911 on 2025-02-22 13:44:53.

Hi everyone, I would like to know how often do you backup your OS when you own and use a NAS daily ?

I don't save any documents stuff on my PC, I always use a shared folder from my NAS.

Do you backup your Windows Install once a week, daily or every months ?

I'm using Macrium Reflect V8 Free and right now I use this backup schedule :

  • 1 Diff every friday

  • 1 Full the first friday of the month

  • 12 weeks retention for the full

  • 4 weeks retention for the diff

  • I don't use incremental because it's a paid feature.

I run a Clonezilla every month just in case too.

Thanks for your comments

3210
3211
 
 
The original post: /r/datahoarder by /u/musicalstuffhitter on 2025-02-22 12:10:01.

I’m looking for a service to automatically backup my data from OneDrive and other cloud services to my external drive on a regular basis.

Does anyone have any recs? I came across Goodsync, but reviews seem mixed and I’m open to alternatives.

I’ve tried OneDrive’s folder sync on my computer and sync is extremely slow, so that’s not an option.

3212
 
 
The original post: /r/datahoarder by /u/FulanoVandergot on 2025-02-22 11:27:57.

Every 1 month? 6 months? or once a year? I want to avoid undetectable errors on a 9-year-old DDR4 RAM module. What do you think?

3213
 
 
The original post: /r/datahoarder by /u/EdibleSuds on 2025-02-22 11:18:40.

I recently bought a 2tb drive to digitally back up our dvds, cds, etc. on and make more accessible... turns out we exceded that amount, aaand could use more space for more media we wish to aqquire(my parents rewatch shows in their entirety.) Ive counted 15 requests. Our How its made collection has taken up little over 200tb with extras, but I hope this is the largest series we have.

Am I crazy for wanting to buy 10tb external drive to lug out into the living room with my laptop? The current drive will be used for music and podcasts since those have taken up so little space in compaeison.

Im think of Seagate's STKP16000400 16tb model because $250 is my limit I can push past a little.

3214
 
 
The original post: /r/datahoarder by /u/PsychologicalWear261 on 2025-02-22 10:09:45.

Netflix has recently gotten all the wrestlemanias, and the quality is perfect. I would like to get those files as every Wrestlemania torrent ive gotten has vastly inferior quality. Does anyone know of a high quality collection of all Wrestlemanias or a way to rip them straight from netflix. I have tried streamfab but its free trial doesnt let me download them and it costs a fortune to buy it

3215
 
 
The original post: /r/datahoarder by /u/theswedishguy94 on 2025-02-22 08:15:09.

Hey hoarders! I’m a documentary filmmaker who travels constantly, and my Seagate 2.5" USB 3.0 external drive is my lifeline. While I already:

  • Do regular backups (dual drives, 1 x backup on external Seagate, 1 x backup on internal laptop storage),
  • Carry it in a generic hard case,

…I’m paranoid about drops. Restoration costs are insane, so I’d love a cheap secondary layer of protection.

Looking for recommendations for:

  • Budget shockproof cases/sleeves (<$30?),
  • DIY hacks (foam setups? What about silicone or neoprene padding?),
  • Ruggedized enclosures worth migrating into,

I’ve seen silicone sleeves online—any firsthand experiences? Or creative solutions I’m missing?

Thanks in advance! (Bonus points if it’s lightweight/compact)

3216
 
 
The original post: /r/datahoarder by /u/RexicTheKing on 2025-02-22 07:16:39.

I just mean a simple exe to put the specific url in to download all the pictures in a post at full size. I don't know python or any of those dl things.

3217
 
 
The original post: /r/datahoarder by /u/invDave on 2025-02-22 07:04:25.

As in the title - say you are thinking of buying the Samsung T9 4TB. This seems like a reliable and generally speedy (up to 2000MB/s) external SSD.

But for not much more I can get a Samsung EVO 990 4TB (for example) + 40Gbps nvme enclosure that'll run much faster (up to 5000MB/s) with an active fan for cooler and more consistent fast copying of very large data.

For the number:

T9 4TB = 295$ 990 Evo Plus 4TB = 270 $ Ugreen CM642 (ali express) = 60$

For an extra 35$ you get something slightly larger, but overall much better.

Or... You can opt to a cheaper SSD such as the Corsair MP600 core XT 4TB for 240$ bringing both options to the same price point. Or I can use a 20GBps enclosure that'll also be faster than the external drive.

So what am I missing? Why would I want to buy an external SSD instead of internal inside a closure which also has the benefit of being ised in a different/extra miniPC in the future in case I want to, as opposed to an external SSD that only has a single function outside the pc? Size and aesthetics only?

I would also think the enclosure gives better ventilation, especially if it has an active fan as above.

3218
 
 
The original post: /r/datahoarder by /u/86IQ on 2025-02-22 05:41:21.

Hi, I started archiving TikTok back in September 2024, and using tokkit which is horrible to use, but out of the dozens of different methods I've tried has worked at scale, and managed to archive 672GB's of videos.

I'm just looking to try and build as large of an archive as I possibly can that can act as a cultural snapshot of TikTok regardless of your views on the platform I’d like to save as much as I can simply for archival purposes so I’m looking for recommendations of what to archive on TikTok

Happy to accept anyone's TikTok data to add to the archive too if you don't have the time or space to download everything. ultimately I want to share the archive so such a large chunk of online media is never lost to history.

3219
 
 
The original post: /r/datahoarder by /u/deadquantumspace on 2025-02-22 05:05:12.

here are the parts that i have and just wondering about the PCIe lane stuff, not entirely sure what to make of the motherboard spec charts since they dont talk about a 9000 series CPU, if i used the main PCIe slot for a gtx 1070 could i also plug the HBA card in and it has the full x8? just want to make sure that i get everything right and im not going to be bottle necking the HBA card, thanks in advanced!

CPU: https://www.newegg.com/amd-ryzen-7-9700x-ryzen-7-9000-series-granite-ridge-socket-am5-processor/p/N82E16819113843?Item=N82E16819113843

Motherboard: https://www.amazon.com/gp/product/B0CV9BTY7B

HBA Card: https://www.amazon.com/gp/product/B0CYGL4VF4

3220
 
 
The original post: /r/datahoarder by /u/yangkee on 2025-02-22 04:01:04.

Does anyone know if a dump of Fanfiction.net stories made pre-2010 exists? The earliest ones I could find here or uploaded to the Internet Archive come from 2012. I'm looking specifically for a couple of stories that were deleted in early 2010, so went un-included in those efforts.

3221
 
 
The original post: /r/datahoarder by /u/Far_School_2178 on 2025-02-22 02:52:36.

Hi!

I am wondering how I should rip about 2000 dvds. I have experience building pc's so I could possibly build a cheap windows pc with a ton of storage and use that, but what software should I use? Also, once I have ripped them all how should I archive them?

Thanks!

3222
 
 
The original post: /r/datahoarder by /u/Eskel5 on 2025-02-22 01:34:57.
3223
 
 
The original post: /r/datahoarder by /u/LaundryMan2008 on 2025-02-21 21:39:01.

Today I don’t actually have a data storage medium rather a very odd adapter which takes a full size Sony Memory Stick and converts it to floppy for people with a Sony Mavica floppy disk (not the ones with the analog video floppies that had 50 fields/frames of video) camera or for people that have a floppy drive but couldn’t afford a proper Sony Memory Stick reader back then, the usual name was FlashPath as they released adapters for 3 other memory cards and a card that was similar to a payment card, the memory cards were SmartMedia and MMC with the 3rd being the chip card, Sony simply rebranded the adapter for their own use and would lock out other FlashPath adapters using other memory cards besides Memory Stick to capitalize on their proprietary format.

It works similarly to an AUX to cassette adapter for your car but with some more electronics in it to be able to convert the signals on the Sony Memory Stick to something a floppy drive can understand and not reject, on the computer side, there is a driver that needs to be installed to be able to use the adapter as the magnetic coils are only in one place so the heads have to be kept in place to prevent seeking and confusion and to be able to understand the very strange signal coming off the floppy drive (don’t know the specifics but it might be a non standard signal rather than the signal produced by a standard magnetic floppy disk).

I haven’t been able to get the adapter working with the only drivers available on Archive.org, using drivers for other adapters is a no go as the driver will try to detect the disk to see if it’s the one it’s expecting to see, I already had issues with installing the drivers as the installer complained about a dual processor system even though I did not have one (presumably because it was dual core (AMD Athlon x64) and the installer treated it as a dual processor system) so I went into the installer files and set the installer to accept dual processor systems by changing the setting from “NO” to “YES” which worked and installed the software, formatting the memory stick worked but trying to use it resulted in an error, watching some videos showed a thing in the bottom right corner to lock the heads but it wasn’t present in my installation for some reason and after every boot would complain about some monitor application not working and closing itself.

Thank you for reading this Friday‘s post and I hope you have a great day, if you have any queries, thoughts about the format, additional information or to point out a mistake, please put them in the comments :)

Link to previous post, post 12 (29th week): My data storage mediums, post 14 (33rd week) : r/DataHoarder

Link to future post, (To be posted)

MSAC-FD2M Sony Memory Stick adapter on my wall with a small part of a regular floppy disk visible

Another angle with more of that normal floppy disk visible

Sony Memory Stick adapter with a Sony Memory Stick inside

The back of the Sony Memory Stick adapter, you can see the spaces for 2 batteries, a hollowed out space for the spindle so it doesn’t interfere with the drive’s operations and the space where the coil is at (the other side is just filled with plastic)

The Sony Memory Stick that I was using and that the adapter takes

3224
 
 
The original post: /r/datahoarder by /u/DashingPOP89 on 2025-02-21 21:23:32.

Im looking to buy my first NAS for my family home.

  • We have a budget of £150-200
  • would be 2 time machine backups
  • iphone backups
  • general photo + file storage
  • Preferably 4 bays, 2 minimum
  • happy to go second hand

Ive looked around for some and the more i look the more i realise i have no idea what im on about anymore.

3225
 
 
The original post: /r/datahoarder by /u/babyjaceismycopilot on 2025-02-21 21:19:55.

I just had a strange, dystopian idea.

If I archive all of my communications, (chat, emails, text messages) in the not so distant future you could create a fairly realistic chatbot with that data. I would think the larger the sample size the more accurate you could make it.

If I want to alstart, how would I go about doing that?

view more: ‹ prev next ›