It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
3351
 
 
The original post: /r/datahoarder by /u/pmigdal on 2025-02-18 02:20:20.
3352
 
 
The original post: /r/datahoarder by /u/WorriedBlock2505 on 2025-02-18 01:59:21.

Weird idea I've been bouncing around recently is to install my OS onto a mirrored zpool made of 4 disks. Once a week with the PC powered off, I would pull a disc to put into storage as a backup, insert a replacement disk, turn the PC back on, and import the new disk. I figure I could have about 8 spare disks that I rotate through. I'd do incremental backups to a USB stick and a cloud provider for my daily backups.

The goal is a backup solution that's as close to plug and play as possible and to have my complete OS with programs and settings ready to go in case of PC failure.

Any adjustments I need to make to this idea, or is it dead in the water?

3353
 
 
The original post: /r/datahoarder by /u/c9898 on 2025-02-18 01:40:19.

This will probably be turned on once every week/month for like a simple backup and then stored away. Is something like this reliable enough?

https://www.amazon.com/dp/B0CRJQ54LW

3354
 
 
The original post: /r/datahoarder by /u/m_a_schuster on 2025-02-18 00:36:10.

I'm trying to make use of a solid Sabrent metal 3.5" SATA enclosure with fan, which features eSATA and USB2 connections. I believe Newegg (Rosewill) sold something similar also with a USB3 version (RX358 series).

The obvious solution would be a USB3-to-eSATA adaptor, of which I have 3 old very ones in the drawer, but none of these properly supports large Enterprise HDDs.

3TB+ disks partitioned via direct SATA, or in a modern USB3 enclosure, are not recognized via these adaptors (they appear raw/unitialized). I am assuming these all date from the XP era wherein advanced format drives underwent reverse sector translation, which modern enclosures/controllers do not do.

Looking on line it seems like vendors have been producing the same adaptors for 10 years or so, That includes the ones I've tried:

VANTEC CB-ESATAU3-6 (VL711 chipset)

BYTECC USB3-ESATA (unknown chipset)

Noname black (sold under various Chinese brands) JM539-based adaptor dongle.

Before I give up the fight I thought I'd ask: are there any modern USB3-eSATA adaptor dongles/cables which do not do reverse sector translation on drives >3TB? Specific links would be appreciated. Thanks!

3355
 
 
The original post: /r/datahoarder by /u/NWSpitfire on 2025-02-18 00:35:14.

Hello everyone,

Sorry if this isn’t the right place to post this.

I have been given quite a lot of old documents and negatives etc and I’m trying to work out how to digitise them.

I am honestly not completely sure at this point exactly what I have because there is quite a lot of it, but most of it should be related to WW2, and maybe some things from afterwards.

Anyway, I thought it might be nice to digitise them and then put them on a website/FTP server so people can look at them (if that is practically/economically viable?).

My question basically is; what is the best way to digitise this stuff?

For A4 documents I guess I could try and make up a stand or something to hold my iPhone with some bright LED’s?

But what about the negatives? How would I go about digitising those. A lot are from handheld cameras, but I also think there are some from photo recce Supermarine Spitfire’s (so Williamson F52 cameras?)

There might be other things too like large maps etc. How could I digitise them without loosing detail?

I’ve never done anything like this so have literally no idea what I am doing. Any advice would be greatly appreciated!

Thanks

3356
 
 
The original post: /r/datahoarder by /u/testaccount123x on 2025-02-18 00:23:12.

I have 10 years worth of files for work that have a specific naming convention of [some text]_[file creation date].pdfand the [some text] part is different for every file, so I can't just search for a specific string and move it, I need to take everything up to the underscore and move it to the end, so that the file name starts with the date it was created instead of the text string.

Is there anything that allows for this kind of logic?

3357
 
 
The original post: /r/datahoarder by /u/Sterbn on 2025-02-17 23:35:36.

Looking to get 8 HDDs and my ATX motherboard in a case. Was looking at the fractal design meshify 2, but they seam to be out of production. The dark rock classico seams ok, but I would prefer something better to build in. Suggestions?

3358
 
 
The original post: /r/datahoarder by /u/Different-Ad7658 on 2025-02-17 23:23:59.

I need some help and advice I moved some data pictures and stuff to my flash drive But it took a while for them to load up when they shouldn't have and my phone is old But it's still useful, but it keeps disconnecting my flash drive.

I have a USB-C flip flash drive I had to move everything, back to my phone because i was afraid of losing stuff when I've been trying to clear up memory.

My problem is when I try to view the I get mountains and! Icon blank pictures I still have the data I just can't view it so how can I view it again and how do I stop my flash drive from disconnecting

3359
 
 
The original post: /r/datahoarder by /u/carriedmeaway on 2025-02-17 22:58:55.

I just happened to end up on a site of the Endowment of the Humanities and wondered if ArchiveTeam or any others had plans to archive them and the Endowment of the Arts ? There’s a lot of good data on critical analysis of US history that would be awful to lose.

3360
 
 
The original post: /r/datahoarder by /u/Serpentarrius on 2025-02-17 21:59:12.

These are not all related to DEI. One includes how EMTs and first responders react to hazardous substance release.

3361
 
 
The original post: /r/datahoarder by /u/PricePerGig on 2025-02-17 21:28:30.
3362
 
 
The original post: /r/datahoarder by /u/Li-renn-pwel on 2025-02-17 20:46:15.

Original Title: Trans and other GRSM victims are being purged from NamUs and other government websites. If you are aware of a non-cis Jane/John Doe, murder victim or missing person, please attempt to save their profile before they disappear or comment their name for someone else to make a record.


A few days ago, someone posted on r/gratefuldoe about Julie Doe's profile disappearing. At the time the poster hoped it meant she had been identified. Unfortunatly it seems more likely that she was removed for being trans.

The next day I searched and found namus_mp79807 had also been purged from the site.

Not long after, NamUs posted a notice that they are attempting to comply with Trump's executive order. At the moment it is unclear if these profile will remain purged or if NamUs will 'only' remove mentions of their gender identity. At best, this will make identification and solving cases harder (if a trans girl transitioned after disappearing, people looking for their 'son' will overlook her profile if she is properly identified as a girl. Or if she was identified as a man instead, those who knew her as a woman would overlook her profile) at worst they would remain gone and will disappear. GRSM people are at greater risk of violence than cishet people and are often specifically targeted because of the belief no one will look for them. It is horrific that the government would do this as it will only make them more vulnerable.

Unfortunately, NamUs does not allow you to search specifically for GRSM terms. I am not sure how many profiles are still up, when I made my first post, there were a few people were linking that still opened. Trans Doe Task Force has made a statement of their attempts to deal with this crises and figure out which cases have been affected. However, the task is made entirely of Trans people and their loved ones which means that they are also having to manage how these executive orders are affecting their own lives.

These people are human beings and they deserve justice. They deserve their names. Their friends and family deserve answers.

If you are aware of a GRSM on NamUs or NCMEC or just know of a victim we can create a record of, please grab the info before it disappears. If you don't have time for that, please just leave a name/identity and I will do my best to make a record on my own.

ETA: thanks for the award! I really appreciate it beach’s I’m having to fight to keep these posts up. Unresolved mysteries is refusing to repost my original post, initially calling it too political and then saying it no longer mattered because I had posted here. I think the mods there do a great job and seem to have mostly been allies. I am hoping it is just one or two of them making this decision and the other mods will see that this is insanely important.

3363
 
 
The original post: /r/datahoarder by /u/chineke14 on 2025-02-17 20:37:26.
3364
 
 
The original post: /r/datahoarder by /u/PossumSymposium on 2025-02-17 19:19:29.

I tried an Epson ET-2800

https://epson.com/For-Work/Printers/Inkjet/EcoTank-ET-2800-Wireless-Color-All-in-One-Cartridge-Free-Supertank-Printer-with-Scan-and-Copy/p/C11CJ66202

and while the resolution was decent, it didn't really seem like enough, and then I tried my scanner at home, and it was even worse than that. I have 1000's of old photos, and I'm hoping to find something that's decent enough to where I can zoom in a good amount without it becoming immediately pixelated. the two consistent possibilities that come up are

The Epson Fast Foto

https://epson.com/For-Home/Scanners/Photo-Scanners/FastFoto-FF-680W-Wireless-High-speed-Photo-Scanning-System/p/B11B237201

epson v600

https://epson.com/For-Home/Scanners/Photo-Scanners/Epson-Perfection-V600-Photo-Scanner/p/B11B198011

these are the main two I'm considering, but my main concern is that it scan things in such a way that the quality of the scan is equal to the quality of the picture I'm holding in my hand.

Personally I'm thinking about getting the Epson v600 to ensure the best outcome possible. also it scans negatives:

https://epson.com/For-Home/Scanners/Photo-Scanners/Epson-Perfection-V600-Photo-Scanner/p/B11B198011

has anyone had a better experience using a different type of hardware?

3365
 
 
The original post: /r/datahoarder by /u/pmbrandvold on 2025-02-17 18:36:59.
3366
 
 
The original post: /r/datahoarder by /u/BackFlip2005 on 2025-02-17 18:09:55.

Hey everyone,

I’ve just come to the realization that I’ve been a Data Hoarder my whole life without even knowing it. Since I was 12, I’ve accumulated a paranormal amount of data, thinking it was completely normal… until I realized that owning thousands of CDs and terabytes of meticulously stored files wasn’t exactly common behavior. Ebooks, the most obscure conferences, art installations...movies, you name it.

But here’s the lame twist: I have nothing left. Everything I had hoarded over the years is gone: lost due to various reasons (degradation, storage failures, chronic depression…). I had to learn this lesson the hard way, and now I want to make sure it never happens again. More importantly, I want to help others avoid the same mistakes.

One of my biggest passions has always been YouTube. I’ve been deep into the platform for the past 20 years, witnessing its evolution, mass deletions, and its massive impact on digital culture. I no longer have my old archives, but I’ve gained extensive knowledge about formats, trends, vanished channels...

So now I’m wondering: What can I contribute to the archival community?

I’d love to put my experience and knowledge to good use, but I’m not sure where to start. Are there any active projects related to YouTube that I could join? Any initiatives I should be aware of?

Thanks in advance for any advice, and I’m looking forward to connecting with people who share this passion!

3367
 
 
The original post: /r/datahoarder by /u/UsernameTakenIThink on 2025-02-17 17:04:56.

Looking for some ideas. I am currently in the process of hoarding movies and tv shows. I am up to ~150, with plans for many many more. I am working off of a spreadsheet that was provided to me, and it has a wide collection of media, but not complete series. Example being one movie out of a trilogy, or one season of a show. the tv shows are easy because of the seasons, but I was wondering if anyone had any thoughts on how best to determine what movies I am missing from a certain collection. I have a running spreadsheet of what I have archived so far, and I am not against just going down the list manually searching for information on each one, but I am looking to see if there may be an easier way. Thanks for the thoughts!

3368
 
 
The original post: /r/datahoarder by /u/deathstrawnote on 2025-02-17 16:52:57.

Recently purchased Cenmate two bay DAS? Which HDD brand(WD, Seagate, Toshiba) works for Cenmate DAS?

3369
 
 
The original post: /r/datahoarder by /u/mattblackonly on 2025-02-17 16:46:44.

I created an app called SpotSpot that allows you to search for music via the Spotify API and download tracks from YouTube using SpotDL/yt-dlp.

However, the quality with yt-dlp is limited. Are there any better libraries or tools for higher-quality downloads?

Basically, I'm looking for something as a drop in replacement as SpotDL (or minimal changes) and preferably something that works with free accounts.

Any recommendations would be greatly appreciated!

https://github.com/MattBlackOnly/SpotSpot

https://preview.redd.it/bceeysy0gqje1.png?width=1350&format=png&auto=webp&s=e9e0ffd7995e62f1a3edb50b4748109608bae3f3

3370
 
 
The original post: /r/datahoarder by /u/Gyrobreaker on 2025-02-17 16:45:20.

This is my first time shopping for drives in the 500gb-1tb range, and I've had my eyes on a Crucial brand external drive. I'm just wary because drives (vs just flash memory) can be expensive, and I want something that'll last me a while.

I'll be using this for my games (stuff like visual novels); artwork/photography, and some backup, so I'll be using it frequently.

Any experience with Crucial brand external ssds? Specifically looking at the " Crucial X6 SE 1TB External USB-C/USB-A Portable SSD "

3371
 
 
The original post: /r/datahoarder by /u/milkbeard- on 2025-02-17 16:42:40.

I’m looking to populate a new NAS with new disks. Noise is not an issue, I’d like the most dependable option possible.

Option 1 - Seagate Exos 8TB from Newegg, for $140:

https://www.newegg.com/p/1Z4-002P-023P8?item=9SIAAEEJNH0692&%5C_gl=1%5C17ghf3m%5C%5C_gcl%5C_aw%5CR0NMLjE3Mzk3NTMxNjMuRUFJYUlRb2JDaE1JNXZuLXZyM0ppd01WcDFqX0FSMmNjQ0JtRUFRWUVDQUJFZ0luMVBEX0J3RQ..%5C%5C_gcl%5C_dc%5CR0NMLjE3Mzk3NTMxNjMuRUFJYUlRb2JDaE1JNXZuLXZyM0ppd01WcDFqX0FSMmNjQ0JtRUFRWUVDQUJFZ0luMVBEX0J3RQ..%5C%5C_gcl%5C_au%5CMTk0MjUxNzkyOS4xNzM5NjY4NjQ4%5C%5C_ga%5CNDI1MjIwMzEuMTczOTY2ODY0OA..%5C%5C_ga%5C_TR46GG8HLR%5C*MTczOTgxMDI0NC42LjEuMTczOTgxMDI0OS4wLjAuMTg5NDc1MTc5MA..

Option 2 - Seagate Ironwolf 8TB from Microcenter, for $160:

https://www.microcenter.com/product/690299/seagate-ironwolf-8tb-7200-rpm-sata-iii-6gb-s-35-internal-nas-cmr-hard-drive

I’d prefer the exos for durability reasons, but I’m having a hard time finding a reputable seller. The only sellers that have them in stock appear to be Newegg and Amazon. So that’s a risk I guess. What would you do?

3372
 
 
The original post: /r/datahoarder by /u/Anarethos on 2025-02-17 16:12:11.

Hi everyone! First post here. I hope I am un the good subreddit for my questions but seeing that there is a lot of post about Storage Space, SnapRAID, Drivepool, ZFS and so on, I suppose so!

First of all, I am a Windows guy. Although I have some knowledge in Linux, I don’t really use it and all the solution I search should be on Windows.

Secondly, English is not my natural language, I try to do my best but if I am unclear about something, please politely let me know and I will try to re-explain!

I have 2 home-server. Everything is “consumer grade”.

  • Server 1 :
  • I5 2nd Gen, 32gb ram.
  • 2x1tb drive in RAID-1 for the OS (small drive as the borad is not UEFI)
  • 2x6tb drive in RAID-1 for the Data (WD Gold)
  • 1x4tb drive for “internal backup”. (WD Gold)
  • Windows Server 2019
  • 7 running VMs
  • Server 2 :
  • I5 4th gen, 32 gb of Ram
  • 2x8tb RAID-1 for the OS + Data (2 separate partitions) (WD Black)
  • 2x8tb RAID-1 for the Data (Seagate IronWolf RED)
  • Windows Server 2022
  • 3 running VMs

For the RAID, I use Intel RST on both server.

On the server 1, I have an export script that export everyday/week critical VMs to the internal HDD. Most VM have 1 VHDX backup also (so each VM do their own backup on a VHdX on the RAID-1) and the VM (and it’s backup) are exported to the internal backup drive. Every month I do a full export of all VMs on an external USB drive.

Server 2 don't have export/backup script. I do manual backup on an external USB drive from time to time.

Both servers are "Bitlocked". OS, Data, Backup, etc. The VMs are not "Bitlocked" by themself, only the drive where the VHDx are.

I decided to use RAID-1 (Intel) for resiliency. I made some tests and I can read theses drives from an external enclosure on another computer. Can’t do that with RAID-5 disk though.

So I decided to use that if a server fails, I can simply connect the data drive on my personal computer, start hyper-v and import the VM rapidly.

Now … a family member that work in a datacenter gave me 10 4TB SATA drive (and I can have almost the number I want. Yeah!). I want to add storage to Server 2. More space and backup of critial files.

The casing is too small, and I can’t add anymore disk in it. Anyway, the PSU don’t have any more SATA power jack available.

So, I bought myself an external enclosure (Probox with USB 3 + eSATA), an internal eSATA card with portmultiplier (Startech) and decided to play with storage space. I want to use the external enclosure in a RAID-5 settings (4 drives). I will be using it to add VHDX virtual disk to already running VMs. Also, as for the internal storage, the RAID-5 will be Bitlocked.

Now … well … nothing works as expected. My first test shows me that USB 3 is faster than eSATA on the enclosure. Also, the speed is abysmal (and with Bitlocker, I may as well use floppies).

I tried to play with the number of columns/Interleaves/AUS and it helped a little bit but not that much.

I found that 4 columns settings, with an interleaves of 256kb and AUS of 512kb is the fastest (although I would I though that a 3 columns settings on 4 drive of 256kb interleaves and 512kb AUS would have been faster). But even though I manage to only get 1941 megabytes per minute (Robocopy result) on that setting. Half in eSATA! And that’s without Bitlocker. I found also that using ReFS is worst and unstable.

So now .. I asking you .. what should I do with that? I need a way to have more storage, for my Hyper-V VMS to host my VHDX, the drive must Bitlocker enabled, and I don’t have a lot of money. I can’t buy another computer, put Truenas on it and share the drive with iSCSI by example. Purchasing a new NAS seems a little overkill since I just need a way to have RAID-5 disk avlaible for my Windows box, not a full blown soluting with CIFS file sharing, etc.

Some people speak about SnapRAID but, if I understand, it is mostly for parity, on not frequent modified file and it is manual. Here, I will be using the drive for VHdx on a file server, so I think it can’t work for me.

Drivepool seems to do pooling but not parity and I am unsure about the way it works for big file (VHDx) bigger of a drive itself. Like, can I put a file bigger than 1 drive on it?

Is there any other solution that will works for me? What are my options?

Thanks in advance for your time!

3373
 
 
The original post: /r/datahoarder by /u/thingie2 on 2025-02-17 15:10:04.

I've currently got a home server setup, using a SATA expansion card, but I've got an LSI HBA ready to swap in, but due to the case/space I have, I intend to mount my drives in a seperate enclosure, so I've been doing some research into different SAS cards, expanders & external connectors.

My current plan is to connect as follows:

HBA (SFF-8087) to a converter mounted in the PCI slots (SFF-8087 internal to SFF-8088 external) (0.5m SFF-8087 cable used)

External 2m SFF-8088 to SFF-8644 cable to the HDD enclosure.

Adaptec AEC-82885T SAS expander in HDD enclosure

SFF-8643 to SATA connectors (0.5m)

Anyone with more knowledge of this able to comment if this is a stupid way to do it, or if it'll be absolutely fine? And if it's stupid, what's stupid about it & what should I do differently (I know ideally I'd have an HBA card that has an external connector, but the one I have doesn't have one)?

3374
 
 
The original post: /r/datahoarder by /u/Ms4sman on 2025-02-17 14:03:01.

I am trying to figure out if I'm crazy or what. I have had 5 out of 6 drives from GoHardDrive accrue dozens if not hundreds of SMART errors within days of receiving them and I don't understand if I just have awful luck or if I'm somehow doing something wrong.

TL;DR: I ordered two drives from GHD. Both had dozens of bad sectors and other SMART errors within days. GHD paid for a return label and sent me two more. Both of these had the same issue. GHD told me they were out of that model and offered a refund which I accepted. I then ordered two of a different model from them to try, and one of those has now also had the same issue, and I'm hesitant to trust the last one either at this point.

For background, I'm working with an unRAID server which I have had for several years now and which already has 4 WD Red drives in it, a 4 TB and 3 3 TB ones. The 4 TB is the parity drive and the 3 TBs are the array drives. None of these has ever had any SMART errors at all and the oldest are over 7 years old.

I wanted to upgrade my array since some drives were getting old and all were pretty small, so I ordered 2x Seagate Enterprise Capacity ST12000NM0127 12TB drives from GoHardDrive. This was the first time I'd ordered from them. They arrived well packaged and all that and I put them into my server and started running extended SMART tests and pre-clearing them both. By the time they had finished, both of them had a handful of SMART errors including reallocated sectors, pending sectors, and offline uncorrectable sectors. I know these are manufacturer recertified drives, but my understanding is that if they are recertified they are SUPPOSED to be error free...

I contacted GHD and they were very helpful and immediately sent me a pre-paid return label for RMA and as soon as they were returned sent me two replacements. I figured I must have just had terrible luck and started running SMART tests and pre-clearing the new drives. They both passed extended SMART tests, but then during pre-clearing one of them quickly racked up 1448 reallocated sectors. The other initially seemed OK. I contacted CS again and again they sent me a label to return the drive. However, this time they could not replace it as they said they were out of stock on that particular model. I opted to instead return it for a refund and try a different model and ordered a Seagate IronWolf ST12000VN0007 12TB while I waited for the bad one to be delivered back.

About the time I received this new drive and started checking it out, the OTHER of the original 2 replacements suddenly started having issues. Until that point, it HAD been fine and I'd been successfully using it as a parity drive. It quickly racked up over a thousand reallocated sectors as well. Meanwhile the new drive seemed to be doing OK.

I contacted CS AGAIN, now feeling that they must hate me, and they again sent me a return label and I chose to get another refund. Since the Ironwolf drive seemed to be doing well, I ordered a second one of those. That second replacement arrived a few days ago, and in the process of pre-clearing it, yet again, it racked up dozens of reallocated sectors, pending sectors, and offline uncorrectable sectors. As of today, the first Ironwolf still seems ok, but I'm hesitant to trust it either at this point.

Am I doing something wrong here? I really hope I'm not somehow taking advantage of their CS department. But I can't see how I'd be causing this. All I can think of that might cause this from my end would be maybe a bad power supply in my server or a bad SATA controller on my motherboard. But I don't think it could be either of those because all this time, my original four WD Red drives has been perfectly fine. If there was a power issue or SATA controller issue, you'd think my other drives would be having issues right? Am I missing something totally obvious? What should I do at this point?

3375
 
 
The original post: /r/datahoarder by /u/Marco_1982 on 2025-02-17 13:49:06.

Hello,

anyone can suggest a tool to download my private youtube playlist?

view more: ‹ prev next ›