It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
3176
 
 
The original post: /r/datahoarder by /u/Perseus-Lynx on 2025-02-23 15:27:19.

I was considering this hypothetical scenario where I would have a self hosted large scale library for books. The purpose of this was to see how many books can I store with "just" $1000. One side of the problem is the text compression of the books, but the other is the storage capacity.

It would require external drives of some sort. I assume that HDD are the cheapest? However I'm not sure which brand or which capacity size would be the most economical.

3177
 
 
The original post: /r/datahoarder by /u/m4d40 on 2025-02-23 14:00:34.

Hi, i have a lot of bigger txt, csv, sql (dump) files and wondered what the best way is to organize them and make them better searchable.

first i thought about pushing all in a nosql, but then it would be over 1TB which i think would be overkill to ever try to initiate and do queries from.

Next thought was, searching for common ids or fields, and create my own tree sctructure with files, where then i create an index like file to each with references to the big files where the detailed data about that id/field is stored, so if i want detailed information another script could go to the specific files and lines and grep/collect it.

(i also thought about elasticsearch, apache solr, or sth similar, but i have no knowledge in this are yet)

3178
 
 
The original post: /r/datahoarder by /u/skaertus on 2025-02-23 13:38:17.

I just bought an Orico DS500-C3 for my home setup, with the purpose of accommodating my backup files and my Steam games. I wonder which HDDs should I buy, and whether it makes any difference.

Seagate Barracuda, Exos, Skyhawk, Ironwolf?

WD Gold, Red, Purple, Blue?

Does it really matter considering the 5 GB/s speed of the DAS system? Should I just get the cheaper ones? Or does it make a difference?

Thanks for the help.

3179
 
 
The original post: /r/datahoarder by /u/MasterIntegrator on 2025-02-23 13:14:36.

Long time home lab. What I am seeing in the erasure of freely available knowledge greatly disturbs me. As someone who effectively grew up in the public library daily (not a great childhood) reasons. It angers me to see the erosion of access to ideas and thoughts…being cheered on while liberties are being crushed by laws.

What are some ways and means to help preserve this information so democracy of thought can be preserved?

First time ever I’m having people ask me concerning questions of “can you help me with x” privacy security item personal etc

Torrents? Downloadable wiki? Meshtastic net? What tool is used to copy down sites? To preserve them?

I already have a pretty large infra at home I can run anything needed. Proxmox as the VE.

3180
 
 
The original post: /r/datahoarder by /u/BriefProject on 2025-02-23 11:15:19.

This website contained a lot of interesting materials (e.g. design guidelines for Symbian, MeeGo, Windows Phone). Thank you.

3181
 
 
The original post: /r/datahoarder by /u/zozurr on 2025-02-23 10:32:14.

Hello I have Debian server without gui and I want download some movies from tezfiles. Unfortunately wget doesn't work, also lynx. Any suggestions? Thanks

3182
 
 
The original post: /r/datahoarder by /u/WorriedBlock2505 on 2025-02-23 07:36:42.

ChatGPT says it's not safe because zfs prefers whole new blocks to be written rather than modifying existing blocks. ChatGPT is saying these flags will cause more fragmentation on a zfs disk and also increase storage usage if I have snapshots enabled (which I do).

3183
 
 
The original post: /r/datahoarder by /u/azimuth79b on 2025-02-23 07:35:04.

I want towrite a news aggregator using ai to counteract flooding the zone. Any recommendations? The cheaper tge better ;)

3184
 
 
The original post: /r/datahoarder by /u/KenReid on 2025-02-23 00:43:53.

Hi all,

I'm really disappointed, I got this HDD 4 months ago (warranty has run out at bestbuy) and it's failed on my. EaseUS Partition Master is showing me 100% sector failure. Can't wipe it, can't access anything in it at all with it, linux or windows tools. So I have a fancy paperweight I guess.

My next steps - should I consider taking it to some kind of repair place? I imagine it would be worthwhile to fix it rather than get a new one. Or is something like this just unfixable?

Thanks all.

3185
 
 
The original post: /r/datahoarder by /u/nameless0711 on 2025-02-23 05:48:14.

There is a shocking number of strategy guides that clearly exist in abundance but are nowhere to be found on the internet. Like the Bradygames guide for Dead Rising. Can be easily found in physical form but not digitally. The question then arises of how much of this is Brady doing their best to make a guide that they do not even produce anymore unattainable digitally; or possibly it is that no one seems to value the act of archiving data that is still physically attainable ...for now.

An example of one in my possession that I couldn't find for the life of me(but now have) is "silent hill 3's official strategy guide(Bradygames)" which Id be glad to let anyone use/archive if needed(link)... and If you have the Dead Rising Strategy Guide by BradyGames, Please share!

3186
 
 
The original post: /r/datahoarder by /u/Goats_vs_Aliens on 2025-02-23 05:29:45.

We have two desktops we store all our documents and pictures to and have a couple terabytes of accumulated data over many years and are starting to worry about losing it. We don't have a lot of money and would like a backup option of some sort, I looked tonight for WD My Cloud but it seems like it's no longer sold and ran across a Buffalo LinkStation 210. Open to suggestions.

3187
 
 
The original post: /r/datahoarder by /u/bigredsun on 2025-02-23 03:46:51.

I was doing a backup from my phone a Motorola Edge 40, which is terribly slow for USB transfers for some reason, to a backup 2TB WD blue hdd I had and it seemed odd that it was hanged @ 100% disk drive but ok I thought, maybe it's just because there are a lot of files (30gb) but after I while I noticed in the task manager that the transfer rate was 0 and usage was at a 100 still, so, canceled the process with no luck, rebooted and then the drive started to do the click of death.

Could it be possible that the transfer iteslf broke the drive?

3188
 
 
The original post: /r/datahoarder by /u/-wildcat on 2025-02-23 02:43:36.

I wrote this script for my own personal use but decided to put it up on my website and share it with the community. I have written a thorough article explaining how the script works and how to run it. Unlike some scripts that only do a single page, this script will loop through all the pages of your library and download every available book.

It has been tested on both Windows and macOS. It downloaded my library of almost 1,000 books without issue. It should work fine on Linux, but it hasn't been tested. I have only tested it on the Amazon.com US site as that is all I have access to. It may work on other Amazon sites, but I imagine there are probably changes that would break it.

I would love feedback on both the article instructions as well as the script.

Some of the script's features:

  • Automatically Downloads All Books: Loops through each page of your Kindle content library and downloads each book.
  • Fast: Processes around 25 books every 90 seconds.
  • Detailed Real-Time Output: The script provides clear, real-time output in the terminal and a log file, allowing you to follow along with each step, see progress updates, and identify any issues as they occur.
  • Detailed Logs: Tracks downloads, skipped books, and errors, saving all data to log files.
  • Custom Page Ranges: Use --start and --end arguments to define which pages to process.
  • Stop Any Time: Press Ctrl+C during execution to stop the script and receive a summary.
  • Device Selection: Pick your preferred Kindle device for downloads through an easy, one-time pop-up.

If you're interested in trying it out, please read through the page below and download the script. I will try to help here with questions and issues as I can. Please share your feedback and share the link with anyone you know who might be interested.

https://defragg.com/bulk-download-kindle-books/

https://preview.redd.it/sfjahv2gyske1.jpg?width=1200&format=pjpg&auto=webp&s=2db811a073fc0f3e91ab9ee0db68d299da392b74

ETA: I have confirmation that the script works on amazon.in just by changing the URL two places in the script from amazon.com to amazon.in. Thanks /u/g3ppi

3189
 
 
The original post: /r/datahoarder by /u/BackToPlebbit69 on 2025-02-23 02:36:22.

Hey there,

I got an old desktop I use for Linux Mint with two drives. One for OS, the other for data (8TB).

I am thinking I want some kind of way to do an rsync backup and buy a separate 8 TB drive to mirror it.

Question is, should I opt for something like an old Raspberry Pi 3 or 4, and use a SATA to USB converter for a regular 3.5 inch 8TB Seagate or Western Digital Drive along with some 3D printed enclosure?

Or should I get something like an HP thin client for the same thing since it's going to be a USB type JBOD setup?

EDIT: I forgot I have an old mini PC running Kubuntu on it too so maybe I should just opt for a USB enclosure JBOD setup?

Curious for recommendations on workflow for what small PC or device to get as well as the types of drives since I even have debated just getting an external 8TB self powered drive for this too.

Before you ask, I just want a pair of two drives in this scenario since I already have an old external with the most sensitive stuff on it anyway for old storage so no need for 3 x 2 x 1 scenario.

Thanks!

3190
 
 
The original post: /r/datahoarder by /u/Neccros on 2025-02-23 02:05:02.

Anyone use this program to archive You Tube videos?? I use it almost daily. Had a issue with it failing to analyze anything on all my PCs yet after a week, they pushed a update that fixed it... Not a software version update, but like a database update.

So what I am asking, how does this program "analyze" the URL? And what makes it work or fail? Friend was swearing up and down it was a DNS issue since I could grab videos through my VPN.

Any insight would be appreciated. Just trying to learn how it works in general

3191
 
 
The original post: /r/datahoarder by /u/fundementalpumpkin on 2025-02-23 01:05:24.

Edit: And they're gone, like I said below there were only 6 left when I posted, if you got one grats.

https://www.walmart.com/ip/WD-20TB-Elements-Desktop-External-Hard-Drive-WDBWLG0200HBK-NESN/1049105244?classType=VARIANT&from=%2Fsearch

This may be fairly normal price or a repost, I don't know. I had an 8GB drive fail and as I've been replacing drives I've swapped them out for the 20TB WD shucked drives, I had an extra one saved for a hard drive failure, but now I need one for the shelf.

I haven't seen a sale since Black Friday, so I thought I was screwed or going to end up buying used on ebay or something (I'm anti-exos due to their failure rates at backblaze, warranted or not), and then I was reading another old post here on this subreddit about walmart having a different part number to make it hard to compare prices, and lo and behold I go check and it's on sale, so I grabbed a couple for stock.

I've seen them for $250, but this was close enough for me. Says there's only 6 left, so ymmv.

3192
 
 
The original post: /r/datahoarder by /u/skaertus on 2025-02-22 23:25:46.

I am searching for a DAS solution to replace my 14 TB external HDD. I am trying to decide between these two options (please note that, as I live in Brazil, options here are very limited):

  • Orico DS500-C3: It supports up to 5 HDDs (90 TB total). It is easy to insert the HDDs, with no need of screws (which is a big plus for me). However, it does not support RAID. It has a transfer speed of 5 Gb/s. It is cheaper.
  • Yottamaster Y-Focus FS5C3: It supports up to 5 HDDs (90 TB total). The installation of HDDs require screwing. It supports RAID and has a transfer speed of 10 Gb/s. Also, it looks more robust and seems to have better cooling. But it is more expensive.

Which should I go with? Any experience with them? Will I notice the speed difference?

3193
 
 
The original post: /r/datahoarder by /u/Sarnuxe0 on 2025-02-22 22:58:35.
3194
 
 
The original post: /r/datahoarder by /u/YesterdayEven5265 on 2025-02-22 22:24:53.

Original Title: Hey bros I just got the Seagate Expansion Destkop 20Tb but it’s been already turned on 12 times. For the rest some stuff are at already at an 100 which I don’t think is a good thing, could any brother tell me if the drive is faulty or used etc ? Thanks in advance <3

3195
 
 
The original post: /r/datahoarder by /u/M_Essergany on 2025-02-22 21:56:45.

I recently got a new Seagate Enterprise Capacity 3.5 HDD v7 SATA 12tb ST12000NM0127 that has date of manufacturing date of 30 Mar 2018 for 9000 EGP (178 USD) in Egypt. Does that mean the hard drive is refurbished and do i use it or should i try to sell it? I'm a little worried about the old manufacture date.

I'm using it to store video files and game installation files for my home pc.

what do you guys think?

P.S. that price has more TB/$ compared to other hard disks available to buy here

for example: Seagate Exos Enterprise 10 TB (model has NM01/d in it, I think, the image is not clear) date of manufacturing 28 Sep 2024 is 14000 EGP (277 USD)

and Western Digital Ultrastar 8TB HUS728T8TALE6L4 date of manufacturing 22 Sep 2024 is 12600 EGP (249 USD)

So, Could you please advise me on what to do?

3196
 
 
The original post: /r/datahoarder by /u/BryanNJ7 on 2025-02-22 21:07:00.

Is there any program that will automatically sync/backup my external drive to another drive immediately as I plug it in? I want them to both always have the same files, folder structures etc. If something moves or gets deleted from the external I want the other the same.

3197
 
 
The original post: /r/datahoarder by /u/goscott on 2025-02-22 20:16:43.

As most people here have probably already heard, Kindle is removing the ability to download Kindle books to your computer on February 26th. This has prompted some to download their libraries ahead of the shut-off. This is allowed/supported on the Amazon website, but it's an annoying process for people with large libraries because each title must be downloaded manually via a series of button clicks.

For anybody interested in downloading their library more easily, I've written a browser script that simulates all those button clicks for you. If you already have TamperMonkey installed in your browser it can be installed with a single click, but full instructions on how to install and use it can be found here, alongside the actual code for anybody interested.

The script does not do anything sketchy or violating any Amazon policies, it's literally just clicking all the dropdowns/buttons/etc. that you'd have to click if you were downloading everything by hand.

If you have any questions or run into any issues, let me know! I've tested this in Chrome on both Mac and Windows, but there's always a chance of a bug somewhere.

Piracy Note: This is not piracy, nor is it encouraging piracy. This is merely a way to take advantage of an official Kindle feature before it's turned off.

tl;dr: Script install link is here, instructions are here.

3198
 
 
The original post: /r/datahoarder by /u/RatzzFatzz on 2025-02-22 19:38:42.

Hello fellow hoarders,

I've been fighting with a big collection of video files, which do not have any uniform default track selection, and I was sick of always changing tracks in the beginning of a movie or episode. Updating them manually was never an option. So I developed a tool changing default audio and subtitle tracks of matroska (.mkv) files. It uses mkvpropedit to only change the metadata of the files, which does not require rewriting the whole file.

I recently released version 4, making some improvements under the hood. It now ships with a windows installer, debian package and portable archives.

Github repo

release v4

I hope you guys can save some time with it :)

3199
 
 
The original post: /r/datahoarder by /u/andxet on 2025-02-22 18:47:04.

Hello!

I want to scan all the ~8000 family photos, I've seen other threads suggesting scanners that costs about 500€ but I would like to check if some other used scanners I found on the internet are ok for the job.

First question: how are document scanners different from photo scanners? The difference is the resolution, the colors fidelity or other? It's a good idea to use a document scanner instead of a scanner created specifically for photos?

I've found the following used scanners, can you suggest me if any of them are good to scan fast my family photos?

  • Fujitsu ScanSnap S1300

  • Epson WorkForce DS-510

  • Brother ADS-1200

I would avoid flatbeds for the amount of photos, but I accept suggestions.

Thank you!

3200
 
 
The original post: /r/datahoarder by /u/Builder365 on 2025-02-22 18:37:30.
view more: ‹ prev next ›