It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
6376
 
 
The original post: /r/datahoarder by /u/LavaCreeperBOSSB on 2024-09-20 20:45:01.

Hey all,

have a long chat history with a snap user with a lot of saved in chat media that i'd like to download with timestamps. I tried snapchat "takeout" but there is no timestamp metadata I could find at all, only dates. Any thoughts on how I could do this?

6377
 
 
The original post: /r/datahoarder by /u/gaybrat666 on 2024-09-20 20:44:17.

I’m not really a data “hoarder” I’d need like 2 TB max for the foreseeable future but I’m just now learning having everything on HDD or SSD isn’t great because they’ll both fail over time, are there any better solutions to cheap data storage other than have multiple HDDs for backups and swap them out as they die?

6378
 
 
The original post: /r/datahoarder by /u/-notreddit on 2024-09-20 19:19:21.

Anyone tried it yet?

Free version is limited to 2 threads max, btw

On version 3, I seem to have been getting greater performance on copying full folder backups of photos from one SSD to another with buffer size equal to 4MB. At higher number the transfer speeds dropped on average. I have 64GB RAM

I'll try to play with various settings on this one; but does anyone have any ideas or suggestions what settings worked for you? Threads, buffer sizes, blocks?

threads

buffer size and blocks

6379
 
 
The original post: /r/datahoarder by /u/Kevthehustla23 on 2024-09-20 18:48:05.

What is the difference between them? I’m not really sure what the purpose of the WD easystore is. It’s for backups? Can I use it like a regular ssd to transfer files?

6380
 
 
The original post: /r/datahoarder by /u/luuukasch on 2024-09-20 18:09:05.

I saw a post from u/wspnut on how to identify fake flash disks via the F3 write/read method, in this sub.

I got the following result after reading, however I’m not sure what this means, and am unable to find any info on it. Does anyone know and can ELI5?

Data OK: 115.46 GB (242140114 sectors) Data LOST: 988.50 KB (1977 sectors) Corrupted: 987.50 KB (1975 sectors) Slightly changed: 0.00 Byte (0 sectors) Overwritten: 1.00 KB (2 sectors) Average reading speed: 100.91 MB/s

6381
 
 
The original post: /r/datahoarder by /u/kalel8989 on 2024-09-20 18:01:40.

just wondering if anyone backed up this site, it was an interactive map with boondocking sites in the US, disappeared without any notice unfortunately. thank you.

6382
 
 
The original post: /r/datahoarder by /u/SnooKiwis6047 on 2024-09-20 17:11:17.

Bought this

https://a.co/d/1lmACpq

Setup Raid5. Going to be great for my little Plex Server.

Anyway nothing special but thought I would share. The enclosure is working really well for my uses (though it seems to have a max r/W of about 220MB but I can not complain at all for the price)

6383
 
 
The original post: /r/datahoarder by /u/DJviolin on 2024-09-20 16:48:35.

Thanks to this group, I successfully implemented DrivePool as a "RAID 1" software solution to mirror drives in case of one fails. I have 2x2TB in a pool mirrored in my PC. This way, kind of secured my live works from one possible failure.

I'm a photographer and I keep my shoots in external HDDs. Mostly WD My Passports, I'm not so proud of this, I learnt in the past days that they doesn't have standard SATA connectors...). I always tried to copy shoots at least to two drives, but you know how it goes, manually doing something not represents 100% coverage. I thought about DAS or even simpler external 2-bay HDD enclosures (I don't need RAID0, RAID1 and JBOD switches) and mirroring them with DrivePool. BUT the problem remains: at the certain point of time a photoshoot only will be represented powered on my PC: once on a 2X2TB pool and another mirrored pool connected to my PC via USB or eSATA. Single point of falure still presents, doesn't matter everything is on four drives and in two pools.

Is there any software, which does what's in the title, able to compare a drive (or folders) presented in my PC with a previous offline drive, that is already cached? So I want to still copy my data to external drives one at a time, I just don't want to hook up to my PC everything at once.

6384
 
 
The original post: /r/datahoarder by /u/Infamous-Contract-62 on 2024-09-20 16:08:51.

specifically need an asus fa617.

6385
 
 
The original post: /r/datahoarder by /u/sioux612 on 2024-09-20 14:28:47.

I like reading stuff on reddit, like HFY or specifically /r/KoyoteeLaughter, but would very much prefer reading it on my kindle/remarkable tablet over my phone

Can anyone recommend a solution that does not require me to hand select text in 300+ chapters, or screenshot and convert them?

6386
 
 
The original post: /r/datahoarder by /u/--dany-- on 2024-09-20 14:14:50.

So I had a directory on with 4 million images, each about 100k - 1m bytes... And it comes out reading anything off the directory becomes extremely slow. We're talking about rsync or tar at a few GB/hr. It never occurred to me it's so slow. I used to serve the images through a web server. No idea what happened, but now I'm paying my tech. debt to archive everything into a big tarball, then back it up. I know I should have split the directory by file prefix... on contrary the other directory with 600GB images/videos organized in subdirectories by date only took an hour to archive.

But can anybody illuminate me: why is it so slow? And hopefully nobody on this sub would repeat my stupidity again. :D

some additional info: CPU usage is about 4%, memory 2%, disk io at a few 100kbytes/s. The file system is ext4 on RAID0. These images are not tiny text files, therefore the disks are hopefully not totally doing random i/o. And the disks are only about 70% full, so unlikely caused by severe fragmentation.

6387
 
 
The original post: /r/datahoarder by /u/davotoula on 2024-09-20 13:15:54.

I've had two drives in a matter of few days, bought from amazon, show up as wrong model in seagate product registration when registered using the serial number.

I was worried about not genuine products being sold on amazon so reached out to seagate support.

Initially they told me "it's wrong model being displayed on our website, don't worry about it".

When I pressed them a bit on it they told me to return it to amazon and get a new one.

I did that and once again wrong product (better quality) is being displayed.

Is this a known issue?

6388
 
 
The original post: /r/datahoarder by /u/fallenguru on 2024-09-20 12:17:28.

Would like a cold (offsite) backup. Don't have the money for LTO, or the amount of data to make it worthwhile (<20 TB, and not all of that is important). I've been using M-Disc for the really important stuff (documents, family photos, ...), but the capacity and $/GB are prohibitive as far as backing up the hoard goes.

The only solution that comes to mind is using HDDs and praying. Are there any makes / models / features that do comparatively well (badly) at being left on a shelf? Anything to look out for?

Are external drives preferable (included enclosure and controller) or should I go for a hot-swap caddy and use internal drives?

6389
 
 
The original post: /r/datahoarder by /u/VineSauceShamrock on 2024-09-20 10:06:24.

So, I'm trying to download all the zip files from this website:

https://www.digitalmzx.com/

But I just can't figure it out. I tried wget and a whole bunch of other programs, but I can't get anything to work.

Can anybody here help me?

For example, I found a thread on another forum that suggested I do this with wget:

"wget -r -np -l 0 -A zip https://www.digitalmzx.com"

But that and other suggestions just lead to wget connecting to the website and then not doing anything.

Another post on this forum suggested httrack, which I tried, but all it did was download html links from the front page, and no settings I tried got any better results.

6390
 
 
The original post: /r/datahoarder by /u/edparadox on 2024-09-20 08:31:21.

I am on the look out for new drives, but I am on a budget while needing lots of storage capacity. I did not buy drives in years, but I know prices have not decreased since last time. But I would need a price check.

I found the following drive and was thinking that it was too good to be true: https://www.amazon.de/dp/B0CV3LH1YD?tag=synack0f0e-21&linkCode=osi&th=1&psc=1

So, what do you think?

Edit: Obviously, I know these are refurbished, and sold by a 3rd-party seller. However, it is very rare to see something that close to what you could get on serverpartdeal (minus all the import fees) in Europe.

6391
 
 
The original post: /r/datahoarder by /u/SuSaSi on 2024-09-20 08:29:29.

External Hard Drive spins, gets recognized by windows but its not opening on File Explorer. Just a green bar progressing across the address bar. Is there any way I can get my Data!?

6392
 
 
The original post: /r/datahoarder by /u/Far_Marsupial6303 on 2024-09-20 07:52:49.

Mahalo nui loa to the OP at r/Hawaii! I turned on my VPN and all gooder!

https://www.reddit.com/r/Hawaii/comments/1fl5h14/spectrum_customers_facebook_and_instagram_images/

6393
 
 
The original post: /r/datahoarder by /u/Nirbhik on 2024-09-20 07:29:03.

I recently got a paid monthly 20TB plan from Idrive for long term cold backup. After having using my internet bandwith to upload around 5TB the account stopped working and went ‘under maintainance’. Repeated emails to tech support elicited vague repelies like ‘we are working on it’. Finally I called them up to enquire whats going on. The support guy at the other end said the same thing that they are working on solving the problem. When asked for a timeline they said they cannot give any timeline as of now.

Is this a scam!?? Which cloud drive randomly suspends access to your account and doesn’t give a timeline as to when it will be back online? While I blame myself for going for the cheapest alternative I have to say that I also trusted to glittery reviews from PCMag, Cloudwars etc.

I cancelled my subscription and got my credit card company to dispute and refund the payment. In the end I lost some of my internet bandwith and time uploading data.

6394
 
 
The original post: /r/datahoarder by /u/MeltedByte on 2024-09-20 06:59:13.

It is about Icy Bix Raid 2x2.5. The question is can I use USB to connect for data transfer and ANOTHER USB (instead of powercord) to connect for power? Does it will work? Thanks y'all!

6395
 
 
The original post: /r/datahoarder by /u/Interman90 on 2024-09-19 20:36:46.

These days i mainly use optical discs (cd-r, dvd-r, bd-r) for backups.

I have been using cd-r and dvd-r discs for decades and never had one fail due to age.

Yesterday i burned a bd-r and was curious what lasts longer cd-r, dvd-r or bd-r.

The answer shocked me a little bit:

https://www.reddit.com/r/DataHoarder/comments/102jwds/comment/kdy9ura/

I was under the impression (especially since i never had a disc fail) that cd's and dvd's basically last forever if you do not mechanically damage them.

Now i find out depending on the exact disc they might only last 20 years or less.

But i also found that some CD-R and DVD-R discs (that have a gold layer) are designed to last up to 100 years.

I did a little research and found these:

https://www.verbatim.com/prod/professional-optical/archival-grade-gold-dvd-r/ultralife/

https://www.verbatim.com/prod/optical-media/professional-optical/cd-r-archival-grade-gold/ultra-life/

I know the Verbatim brand for years and they claim a lifespan of up to 100 years.

Are these a good buy if longevity and compatibility are the main concern?

Is there an even better option?

Is there also an option like this for Blu Ray discs?

I know M-Disc but never used them because of compatibility.

6396
 
 
The original post: /r/datahoarder by /u/Doomed on 2024-09-20 05:56:50.

This only became a problem for me as I've gone through about 5 PCs and 10 hard drives and 1.5 NAS.

I have lots of partial backups stored across many drives. I want to centralize them into one drive and folder structure, then back up the drive using standard methods.

Backup part is easy. The dedupe part is the wild west.

I'm not talking about "similar" or "perceptual" duplicates. That's a rabbit hole of its own with justified complexity and no objective truth. I mean byte exact copies.

I used jdupes back in 2018. Turns out it had a bug and instead of deduping I was de-filing every last copy I had. Noted: dedupe software should be boring, small, and filled to the brim with tests.

I look around. czkawka seems popular. And to be fair, it looks good. To be fair, it doesn't seem to have deleted anything but duplicates since I started running it. But it's GUI based and that introduces all kinds of error sources. It does more than just dedupe. That's great, I want to use some of those extra features. But I don't want that thrown into one program. There should be one tiny program to do this, with plugins or whatever to do all the extra stuff. czkawka has a CLI but it's not well documented. Testimonials for all these programs are uncommon - same with tutorials.

I don't get why this is so hard. It feels like it should be a one line command for a program designed for exactly this. The fclones docs talk about all the things you can do with the software. And one of them is deduplication. But I want the one, time tested, failsafe, dummy proof, dedupe script. This is not something the user should have to write themselves.

fclones is CLI and tops the benchmarks.

The code has been thoroughly tested on Ubuntu Linux 21.10. Other systems like Windows or Mac OS X and other architectures may work.

(Emphasis added). Danger! Danger! Good news though, I can't even find a Windows binary. So you'd have to go out of your way to do something this stupid.

I want a duplicate finder with 10x as many lines of tests as it has lines of code. It should be fail safe. See: https://rmlint.readthedocs.io/en/latest/cautions.html

JDupes cited this, giving me false security: https://github.com/h2oai/jdupes?tab=readme-ov-file#does-jdupes-meet-the-good-practice-when-deleting-duplicates-by-rmlint

I'm even skeptical of command line options. Depending on the setup of the program, you're giving users a loaded gun and telling them to be careful. Something like this design might be safest:

# find the dupes
dupefinder path:\ >found_dupes.txt
# send the dupes we found to the trash
dupetrasher found_dupes.txt

Fclones does look really good. And it uses this design. What triggered the last part of my rant was the "hash" section of the readme. You, dear user, can choose from 1 of 7 hash functions for deduping. When would you ever need this? It adds a surprising amount of complexity to the code for little gain. Deduping in general, and hash selection specifically, is one of those problems where I want Great Minds to tell me the right answer. What's better for hashing in a dedupe context, metro or xxhash3? I don't know, probably xxhash because it's faster but I have no idea. When the hell would a user need a cryptographic hash on their own files for deduping? Why do you think your users can do this calculation on their own?

Globs introduce error. Great! Why not just read from a config file?

Using --match-links together with --symbolic-links is very dangerous. It is easy to end up deleting the only regular file you have, and to be left with a bunch of orphan symbolic links.

Thanks for the heads up, but this shouldn't be possible if it's that dangerous.

After reading through the docs of fclones and elsewhere I'm not even convinced it should operate across folders or drives. There's so much trickery afoot and the risk of failure is so high.

6397
 
 
The original post: /r/datahoarder by /u/TimeNarc on 2024-09-20 03:20:58.

So I wouldn't consider myself qualifying of the title "data hoarder" but currently I have two external 20 TB drives being used with a popular media server program attached to a micro pc.

One is being used as a main drive and one is a mirror that's currently being populated on a schedule with SyncBackPro.

This had lasted me a few months but now I'm running low on space (2 TB remaining) and wondering what kind of setup may be best for my needs.

Should I just buy two more 20 TB external drives and use some software to mimic a singular 40 TB drive (if that's even possible)?

Should I try to buy some off the shelf 4 or 6 bay enclosure, shuck the drives and then fill the rest up as needed with drives?

Just bite the bullet and bites nails prune my data down? None of the above?

I never thought I'd even fill up an 8 TB drive a few years ago but now having a full 20 TB drive(and a 5 TB seed box that's almost full as well) is something I never expected.

Any advice is greatly appreciated.

6398
 
 
The original post: /r/datahoarder by /u/pixie7777 on 2024-09-20 02:35:40.

Hello, I need help in storing my pictures. My dad passed away so I have the family collection plus about 12k that have to be scanned then a 5-6 hard drives. I currently uploaded 1 hard drive and have the pics from my phone on my mac and it is full. Between just those two it’s about 400 gb.

Is there a solution where I can back it up on an external hard drive or sdd and then see the pics on my phone through the phone app?

I would also like something that supports the live photoi feature as it captures so much of our daughter.

And would love something that capture date and time and also Face ID and etc like the photos app.

I am completely new to this so please bear with me and thank you in advance!

6399
 
 
The original post: /r/datahoarder by /u/BLKMGK on 2024-09-20 01:51:43.

I'm looking at the LSi 9305-16i card, it's a little older now and has gotten cheap enough to pursue. It's listed as a 16 port card but it's got just 4 miniSAS connectors. I see an SFF-8643 breakout to 4x SATA out there but that doesn't seem right, is this cable not for the mini-SAS port maybe? If this really has support for 16 SAS what can I attach to it to support a far larger number of SATA? 4x per SAS port right? What have I missed? Does a better solution exist for a single slot that's not a mint?

I'm currently using two SAS cards and want to free a slot while supporting 24+ SATA drives. I have a SAS card with 6 physical ports my mobo refuses to recognize so this "16" port card caught my eye at sub $100. This is destined for IT mode obviously :) Searches didn't provide any good answers.

P.S. Yes, I know about expanders. No I'm not looking to use one at this time and the backplane I'm currently using in this 4U machine doesn't have one. :(

6400
 
 
The original post: /r/datahoarder by /u/RonBurgandy2010 on 2024-09-20 01:42:08.

I'm looking to migrate my UnRaid server from my absolutely massive NZXT Switch 810 Full Tower case into something more manageable, and the Jonsbo N3 caught my eye. I spent some time mocking up the part list below, and am worried about drive incompatibilities with the drive backplane.

This is my existing server, and obviously the mobo needs to change to Mini ITX for the case. Figured I'd take the opportunity to update the CPU while I'm at it. I chose the Intel Xeon E2288G for a handful of reasons: it has onboard graphics for troubleshooting, it supports ECC memory, and it has a slightly higher core count than my current setup. As far as I could tell poking around on PCPP, the newer AMD chips (7000 and 9000 series) support ECC, but if I go older and Intel, I can save money on what's mostly a glorified NAS that runs Plex and Home Assistant as well as some supporting programs. From there, I picked a MOBO that was Mini ITX and seemed to have the IO I needed. Throw a Mini SAS to SATA cable into the port for 4 SATA connections, 4 more on the board itself, use the M.2 either for more SATA connections or a cache drive, onboard USB A port for UnRaid USB, and I should be good.

My main concern is the compatibility of my HDDs. My drives are a mix of random Enterprise, shucked, and high-capacity consumer drives, the model numbers are in my part list. In my current server, the SATA power connectors from the PSU only really work for maybe one or two of the HDDs (obviously the consumer 1.5 SSD cache drive works fine). The drives are all currently powered by Molex to SATA power adapters, as whatever prevents the drives from using the PSU SATA cables doesn't seem to be an issue on the adapters. I believe at least one of them is shucked (they're pretty much all second hand from ServerPartDeals and refurbished part liquidators on eBay), so maybe the Kapton tape trick could work? I'm just concerned some of my drives won't get power from the backplane, even though they're powered by Molex power in.

I've also never migrated an UnRaid deployment from one machine to another but I can probably find a guide for that on YouTube.

So, should I be concerned with power incompatibility, or is the way the backplane is set up kinda just force power through?

PCPartPicker Part List

| Type | Item | Price | |


|


|


| | CPU | Intel Xeon E-2288G 3.7 GHz 8-Core OEM/Tray Processor | - | | CPU Cooler | Noctua NH-U9S 46.44 CFM CPU Cooler | $59.95 @ Amazon | | Motherboard | Gigabyte C246N-WU2 Mini ITX LGA1151 Motherboard | - | | Memory | Crucial CT9029050 32 GB (2 x 16 GB) DDR4-2400 CL17 Memory | - | | Storage | Western Digital DC HC530 14 TB 3.5" 7200 RPM Internal Hard Drive | Purchased For $0.00 | | Case | Jonsbo N3 Mini ITX Desktop Case | $158.00 @ Newegg Sellers | | Power Supply | Corsair SF600 (2018) 600 W 80+ Platinum Certified Fully Modular SFX Power Supply | $239.00 @ Amazon | | Case Fan | Noctua R8 redux-1800 PWM 31.37 CFM 80 mm Fan | $12.95 @ Amazon | | Case Fan | Noctua R8 redux-1800 PWM 31.37 CFM 80 mm Fan | $12.95 @ Amazon | | Custom | HGST Ultrastar He10 | HUH721010ALE600 | 0F27452 | Non Power Disable | 512e | 10TB SATA 6.0Gb/s 7200 RPM 256MB Cache 3.5" | Enterprise Hard Drive (Certified Refurbished) - w/3 Year Warranty | - | | Custom | HGST Ultrastar He10 | HUH721010ALE600 | 0F27452 | Non Power Disable | 512e | 10TB SATA 6.0Gb/s 7200 RPM 256MB Cache 3.5" | Enterprise Hard Drive (Certified Refurbished) - w/3 Year Warranty | - | | Custom | OIKWAN Internal Mini SAS to SATA Cable, SFF-8643 to SATA Forward Breakout Compatible with Raid Controller Hard Drive (3.3ft) | $13.99 @ Amazon | | Custom | HGST HUH721010ALN604 0F27516 10TB 7.2K RPM SATA 256Mb 6 Gb/s 3.5" HDD Ultrastar He10 4Kn | - | | Custom | HGST HUH721010ALN604 0F27516 10TB 7.2K RPM SATA 256Mb 6 Gb/s 3.5" HDD Ultrastar He10 4Kn | - | | Custom | WDC_WD140EDFZ-11A0VA0 | | | | Prices include shipping, taxes, rebates, and discounts | | | | Total | $496.84 | | | Generated by PCPartPicker 2024-09-19 21:16 EDT-0400 | |

view more: ‹ prev next ›