It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
3726
 
 
The original post: /r/datahoarder by /u/BitterEye7213 on 2025-02-07 20:08:45.

Im looking to get out of external hard drive land and into just running internals externally as backup. I heard when you go past 2 gb things get a bit more complex with what supports what but is there any known enclosures that will run this one?

3727
 
 
The original post: /r/datahoarder by /u/DineLifestyle on 2025-02-07 19:44:57.

Meaning the output file format mrimg have u tried to compress more using winrar, 7zip etc? Or maybe even to try and split file into parts and store across different hdds? Right now have like 975Gb file need to get it down to 850gb to fit an HDD or split to parts and store across two hdds

3728
 
 
The original post: /r/datahoarder by /u/Kooky-Bandicoot3104 on 2025-02-07 17:21:48.

Humans be sure to archive the evga fourms, a gold mine regarding information for old gpus and components!

the fourms are in archival mode, soon to be shutdown to reduce business expenses!

3729
 
 
The original post: /r/datahoarder by /u/busymom0 on 2025-02-07 17:07:00.

I have thousands of links to various articles. Imagine you have thousands of bookmarked links. I need to get the published timestamp for all these links.

I have been able to use newspaper4k library to get the publish_date value:

python3.10 -m newspaper --url="https://phys.org/news/2025-02-antarctic-hoff-crab-males-bigger.html" --output-format=json | jq -r '.[].publish_date'

However, this needs to fetch every single article one by one, then parse it to get the publish_date only. I am trying to avoid having to fetch every article, especially because it has the potential of running into bot detection.

For example, the above phys.org website blocks IP addresses from Digital Ocean servers.

Plus I really don't care about the content of the links, I just need the date.

Another option I thought of doing a google search for each url, then grabbing the date from the top result. While not perfect, it could work.

Another option I thought was getting the RSS feeds from the websites, then searching for my link in it and then grabbing the date. But this won't work because RSS feeds are often only of recent content and older links won't be listed in it.

Is there any other creative way to do this?

EDIT: I am currently looking at if I can use bing or google search API to maybe grab the date.

3730
 
 
The original post: /r/datahoarder by /u/Neverfall94 on 2025-02-07 17:02:44.

I have an emby server and windows 11 VM via unraid. I am at max capacity on drives via slots so I want to get a case. I have 1 slot pcie left but it's behind my 3060 that I use for transcoding. I am planning on doing a traditional JBOD server. Will unraid support a egpu so I can free up the slot to increase my data pool drastically? If not suggestions.

Specs: AMD Ryzen 7 5700X3D 8-Core, MSI PRO B550M-VC WiFi ProSeries MSI Gaming GeForce RTX 3060 12GB Crucial P3 Plus 4TB PCle Gen4 3D NAND 1x Seagate (Recertified IronWolf Pro 18TB Enterprise NAS 7x Seagate (Recertified) 12TB IronWolf NAS SATA 9201-8i RAID Controller Card 6G HBA FW P20 9211-8i IT mode EVGA 210-GQ-1000-V1,1000 GQ, 80+ GOLD 1000W Thermaltake Core V71 Tempered Glass

3731
 
 
The original post: /r/datahoarder by /u/David_Mathers on 2025-02-07 16:40:29.

If you search for a video with youtube url on web.archive.org, many pages do not have saved videos. When you save a youtube page, the video is not always saved. I didn't find how to solve this. I know http://web.archive.org/web/2oe_/http://wayback-fakeurl.archive.org/yt/ but how to add new video in it? I searched but didn't find it. Tried to save mp4 file (got from youtube mp4 url extractor) from video but site is blacklisted. I also know about archive.org/details/youtube- and ask in /r/DHExchange.

3732
 
 
The original post: /r/datahoarder by /u/Jayiz00 on 2025-02-07 11:32:39.

Hi, I have a bunch og 3tb SAS drives that i have gotten for free. (WHAT A DEAL!)

Anyway, i only have a old msi motherboard with 16gb ddr3 ram and an old i7 4770 that only have SATA ports.

So my questions is:

  1. If i buy a HBA sas 8087 card, can i i buy breakout cables from 8087 to sas interface on the drives?

  2. Does a sas HDD use the same power adapter as an SATA HDD?

  3. Will the motherboard even be able to read the sas card?

3733
 
 
The original post: /r/datahoarder by /u/KP30499 on 2025-02-07 09:35:54.

Hi there, sorry for formatting and grammar errors, I am on mobile atm. I want to build a NAS and narrowed it down to the UGREEN DXP2800 with 2 Seagate IronWolf NAS 4TB drives and a Patriot P300 256GB SSD for cache. The System would be set up in RAID 1 for redundancy and would be primarily for storing photos I take with my cameras and editing on the go. I mainly shoot uncompressed RAW files on my Sony A7III (about 50MB each) and edit them to JPEG (about 15MB). It would probably also be used to back up a few files from my PC so I don't have to go through the hassle with Onedrive and Google Drive. Since I only do photography as a hobby, but have also a few hundred photos and videos from deceased relatives I want to keep save, I don't know if the DXP2800 is the right one for me. The current size of my photo and video folder on my PC is about 487GB and is backed up on an external SSD. This worked for me for a long time, but since I now changed my job and travel for work (with my camera because of downtimes) I don't want to carry around my SSDs all the time and have to sync it between my devices. I saw a few posts, that recommended Synology, but from what I've seen their two drive system doesn't have a M.2 Slot for expansion and overall worse components. In my research I also found that UGREEN has an all in one app for their NAS products and Synology has a lot of different ones. Do you have experience with this system and any advice on how to set it up properly or any recommendations for better solutions? Should I preferably go to a four bay system or is the two bay one sufficient enough if I want to upgrade the drives at some point? Thanks in advance.

3734
 
 
The original post: /r/datahoarder by /u/Automatic_Beyond2194 on 2025-02-07 03:53:10.

https://serverpartdeals.com/collections/28tb

I’ve bought all my parts for a jellyfin/storage server.

Now all I have left to buy is my first HDDs.

I haven’t yet decided long term if I’m going for Unraid or TrueNas or what.

Is there anything wrong with these drives or the price? If I do end up going NAS and I need to have all the same size is there anything wrong with going with 28TB? I would assume some sizes are less common and some are more common, and it would probably be good to do it with one of the more common sizes no?

3735
 
 
The original post: /r/datahoarder by /u/iguessthiswilldo1 on 2025-02-07 18:34:45.

Sorry to add on to the dog pile of US politics posts, but has anyone saved a copy of NASA and NOAA datasets? According to the Alt National Parks Service, portions of NOAA's site regarding climate data are already going dark.

https://bsky.app/profile/altnps.bsky.social/post/3lhjhp4opts25

3736
 
 
The original post: /r/datahoarder by /u/latenighttrip on 2025-02-07 18:15:49.
3737
 
 
The original post: /r/datahoarder by /u/zzz_zzz on 2025-02-07 18:02:40.

I'm not necessarily equipped to back these up on my own atm, but I imagine these are going to get purged at some point. Tried looking at some of the resources posted on the front page, but see any reference to CDC datasets in particular.

https://wonder.cdc.gov/

https://wisqars.cdc.gov/

3738
 
 
The original post: /r/datahoarder by /u/Duriel- on 2025-02-07 17:50:53.
3739
 
 
The original post: /r/datahoarder by /u/Wormvortex on 2025-02-07 17:29:50.

I currently have a Yottamaster 5 Bay Hard Drive Enclosure however it's throwing up I/O errors which disappear when I put the drives into another closure so looks like it's the issue.

I'm looking to replace it. Is there a good 5/6 bay enclosure I can replace it with. I don't need any NAS capabilities just a bog standard enclosure that's available in the UK

3740
 
 
The original post: /r/datahoarder by /u/Entire_Scholar_5302 on 2025-02-07 17:21:30.

Hi guys I need finde a way how to change Video quality or compress it so size is smaller

So I can dowenload it small size

Bc my wifi Is slow and it's Limited to 150gb

And if I want dowenload some voe series or and they take long to dowenload bc ea is 300mb or 1gb and

There many +1000 if I do all my fav series

Is there a way I can get it small size before dowenloading

Like a extension or a Programm

I cam do also on kali Linux I habe an other Laptop have this on

3741
 
 
The original post: /r/datahoarder by /u/Dragon3488 on 2025-02-07 16:51:57.

The National Archives has been uploading scanned copies of US Army morning reports from 1940-1946 to their catalog (NAID 85713825). These reports are some of the only unit records that describe who was in an army unit at any given time, and are important records in rebuilding WWII veteran's records that were destroyed in the 1973 NPRC fire. I was wondering if there were any current projects on backing this data up to the internet archive, or some similar site? There's quite a bit, however, to my knowledge this is the only (mostly) complete copy available online outside of the NPRC reading room. If there is a project ongoing, how could I help in backing up these important genealogical files?

3742
 
 
The original post: /r/datahoarder by /u/icysandstone on 2025-02-07 16:41:38.

Do you convert them to H.264 or H.265 or ProRes? I know there are corporate reasons, but it's crazy that macOS still does not officially support WebM. Would love to know what you hoarders do, so I can do it too.

3743
 
 
The original post: /r/datahoarder by /u/Pretend-Ad-6453 on 2025-02-07 15:58:09.

I know there’s 20/30 dollar services but I’m not sure how good they are. I’m willing to learn how to do it but I don’t want to run the risk of ruining these tapes because they’re incredibly important. any advice?

3744
 
 
The original post: /r/datahoarder by /u/AshleyAshes1984 on 2025-02-07 13:59:32.
3745
 
 
The original post: /r/datahoarder by /u/shimoheihei2 on 2025-02-07 13:05:07.

A message on their gitlab page indicates that their hosting provider is closing their doors. Freedesktop is the home of many free software projects. https://gitlab.freedesktop.org/explore/groups

3746
 
 
The original post: /r/datahoarder by /u/Random7321 on 2025-02-07 11:46:23.
3747
 
 
The original post: /r/datahoarder by /u/DepartureMurky198 on 2025-02-07 09:45:48.

i’m not tech savvy in the slightest but i’d really like some help. pms open!

i downloaded both of my accounts data but i don’t know how to access it. can i turn the random sequences of number and letter into thing i can actually go into and view? urls? please help 😭

3748
 
 
The original post: /r/datahoarder by /u/Old-Skool-2023 on 2025-02-07 08:47:40.

Hi. I have a collection of tutorials from Lynda and now Linkedin that are simply as is when ripped.

Whats the best way to compress these down to save some space.

Would you use a combo of winrar and 7zip or is simply adjusting settings on one of these sufficient or indeed are there other ways such as a batch file command to deal with the HD in just one go etc.

Thanks for looking. Tips appreciated.

3749
 
 
The original post: /r/datahoarder by /u/PricePerGig on 2025-02-07 08:46:05.

As requested in this sub, we now have language selection for the main home page on PricePerGig.com (the best price comparison / aggregator for digital storage medium on the internet).

Languages add: ES 🇪🇸, DE 🇩🇪 , FR 🇫🇷 - other languages planned for when their marketplaces come online

So far PricePerGig.com has these USPs, but what do you want to make it your 100% go to place?

  1. We have Speed rating in MB/s for all PCIe interface drives (e.g. NVMe drives)
  2. We have now the exact age of the record showing you when the price was picked up
  3. Fast filter - no more Ctrl+F to find what you want, just type in the filter and you're away
  4. Fast Share - click the share link to send your filters / search to yourself
  5. Filter by accurate (over 98% in my tests) Technology (SSD,HDD), Interface (PCIe, SAS, SATA), and Format (less accurate), Internal/External, 3.5", 2.5" that's consistent across all device types. (uses LLMs/AI to figure this out)

Please vote/suggest what you'd like moving forward

  1. just show me all the drives - i don't care i'm downloading 10MB or more of data - I want to select more than one marketplace
  2. use currency conversion and find the actual global cheapest prices / rank all by a single currency
  3. I need CMR/SMR filter!
  4. Get TAPE and other enterprise storage media added, they are not all there
  5. Go scrape other deals websites to make sure this is the only place I need to go (e.g. deals websites)
  6. I'm still waiting for ebay, newegg, an-other, hurry.
3750
 
 
The original post: /r/datahoarder by /u/YuumaTsuchimikado on 2025-02-07 07:23:25.

Just trying to see if anyone knows of sometype of offline list that I can add movies/tv shows and be able to blacklist duplicates? something like tachiyomi but for manual entries(and not manga obviously).

something like a list saying the movie title, what year or something and being able to automatically detect in case I added a duplicate entry or something.

view more: ‹ prev next ›