It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
9176
 
 
The original post: /r/datahoarder by /u/TheXypris on 2024-06-13 02:47:04.

So I asked r/Plex a few days ago and got some really great advice

One of which was to send me here

So along side what I was told there I had more questions

How is shucking more affordable than just buying the regular drive? And how do I actually find those deals

Are Seagate drives really that bad?

What is the general consensus on buying drives second hand off eBay or through a liquidation auction? Bad idea or risky at best?

I'm assuming Facebook marketplace/Craigslist should be avoided, but is there any merit in looking there? Like if I find a second hand external I could shuck?

How important are RPMs sata type and all that other non capacity info? What's good and what's bad? Tradeoffs?

How long will a healthy high capacity drive realistically last? People keep saying raid is mandatory and act like drives fail because you look at them funny. I can't afford a raid array right now and anything I'm putting on it would be easily required if time consuming.

Are there price monitoring sites I can use to keep an eye out for deals or sales?

And where is the best place to get sata and power cables for the drives?

9177
 
 
The original post: /r/datahoarder by /u/hmhsbritannic12 on 2024-06-13 02:13:58.
9178
 
 
The original post: /r/datahoarder by /u/Ok-Cartographer1745 on 2024-06-13 01:23:38.

I bought a Wolverine 80gb like 15 years ago. You plug in a memory card (not specifically Sony Memory Stick. I mean like Compact Flash, SD Card, and so on), press a button, and it copies your card to a folder on the hard drive.

But that think is old and low capacity and I probably can't upgrade the hard drive due to the form factor and firmware likely only allowing 80gb drives.

Anyway, anyone know of a similar device that is more modern? I want it to:

  1. be portable and battery powered (nothing huge and nothing that needs to be plugged in to work)

  2. have a large SSD (500 GB or more)

  3. be self sustained (I don't want to have to plug it into a PC nor Bluetooth it to my phone)

  4. accept at least SD cards (more slots would be nice, but not necessary)

I mainly want to do it to easily hoard pictures off my camera, especially if I need to quickly empty out my more expensive fast cards

Also, a link to show what I already have. I'm not the one selling it and I don't recommend buying it because it's so outdated and slow (and the battery probably doesn't work considering how old it is). So don't consider this advertising.

https://www.ebay.com/itm/125748798949?_trkparms=amclksrc%3DITM%26aid%3D1110006%26algo%3DHOMESPLICE.SIM%26ao%3D1%26asc%3D266917%2C266785%26meid%3D0caf8ebbb6584b81a8ed70c03fd786fd%26pid%3D101875%26rk%3D1%26rkt%3D5%26sd%3D285760155205%26itm%3D125748798949%26pmt%3D1%26noa%3D1%26pg%3D2332490%26algv%3DSimplAMLv11WebTrimmedV3MskuWithLambda85KnnRecallV1V2V4ItemNrtInQueryAndCassiniVisualRankerAndBertRecallWithVMEV3CPCAutoWithCassiniEmbRecall%26brand%3DWolverine&_trksid=p2332490.c101875.m1851

9179
 
 
The original post: /r/datahoarder by /u/jjsto on 2024-06-12 23:49:24.
9180
 
 
The original post: /r/datahoarder by /u/1000Zebras on 2024-06-12 22:37:27.

Hi,

I've got a couple of drives running off of a couple of instances of Debian, one of which is at my house, and the other is at my brother's house. They're 14tb drives, currently containing ~4.7TB of data.

I'd like to, ideally I think, keep the two drives/filesystems in sync over the internet, probably through Tailscale so no public exposure necessary. At the very least I'd like to have a solid, relatively up to date backup of all of the data that lives on the drive at my brother's house, backing up that of the one at house.

What are my best options for doing so, and, if it were you, how would you go about setting things up?

I'm thinking maybe btrfs snapshots over ssh using btrbkup (both drives are formatted using btrfs) is probably me best bet, but I've never used snapshots and not sure how easy it would be configure in this case. This would, of course, depend on the drives both being btrfs formatted, which I suppose okay, although I was also thinking maybe it's smarter to have just regular backups that are filesystem agnostic.

My favorite straight backup tool these days is Kopia, so if I were to go the second route I'd probably be looking at using that, although I'm not opposed to going restic. The only problem with that is that I think Kopia can only backup to either an S3-compatible bucket (so maybe run minio on secondary sysem?), or through webDAV which I'd have to figure out how to configure on the machine at my brother's house, or to the local filesystem, in which case I could maybe mount the remote disk on the local machine at my place using sshfs, but that may introduce weirdness, or just be a bit too unstable.

What would you do in a situation like mine? Do you have any experience in setting something like this scenario up and what potential pitfalls would you anticipate?

Thank you for reading the somewhat lengthy post,

I look forward to any insights.

Kind Regards,

LS

9181
 
 
The original post: /r/datahoarder by /u/tillybowman on 2024-06-12 20:55:49.

i‘m looking for a compact document scanner that can upload images immediately to some local file sevice like ftp, sftp, smb or something.

i don’t want dropbox or other cloud services involved.

any recommendation here?

9182
 
 
The original post: /r/datahoarder by /u/Mysterious_Crazy9606 on 2024-06-12 20:34:43.

Hey everyone,

I’ve been thinking about a new way to revolutionize data storage by combining piezoelectric materials with 3D NAND technology. Here’s the gist of my idea:

The Concept

• High-Speed Piezoelectric Module: Use piezoelectric crystals that can oscillate at frequencies in the gigahertz range as an ultra-fast data buffer. This could potentially give us read and write speeds way beyond what we have with current tech.
• Main 3D NAND Storage: Use 3D NAND for the main long-term storage. We all know it’s reliable and has a high capacity.

How It Would Work

1.  Writing Data: Incoming data would first go to the piezoelectric module at super high speeds.
2.  Transferring Data: The data would then be transferred to the main 3D NAND storage for long-term keeping.
3.  Reading Data: For reads, the system would first check the piezoelectric buffer for quick access. If the data isn’t there, it would pull from the 3D NAND storage.

Benefits

• Speed: This setup could drastically reduce latency and boost read/write speeds.
• Energy Efficiency: Piezoelectric materials might be more energy-efficient for rapid operations.
• New Storage Architecture: Combining the speed of piezoelectrics with the capacity of 3D NAND could create a super-efficient storage solution.

Challenges

• Tech Integration: Making sure the piezoelectric and 3D NAND components work seamlessly together.
• Cost: High-quality piezoelectric materials and the complexity of this setup might be pricey.
• Durability and Reliability: The materials need to handle high-frequency oscillations over long periods without wearing out.

Potential Impact

If we can make this work, it could be a game-changer for data centers, mobile devices, and industrial applications that need ultra-fast response times.

What do you all think? Could this actually work?

9183
 
 
The original post: /r/datahoarder by /u/Vile-The-Terrible on 2024-06-12 19:03:15.

I bought a NAS less than two years ago from B&H that had Seagate Ironwolf Pro 16tbs in it. One of the drives started to fail so I began the RMA process with Seagate. They charged to have expedited delivery but it took them two weeks to process my order before shipping it. They then send me a nonfunctional drive. I now have to go through the process of RMAing the RMA drive, and the kicker? I have to pay the return shipping on the drive that failed and the broken drive they sent me.

So as the title asks, are there any other companies I should be spending my dollars with going forward?

Edit: In case it wasn’t clear. I understand that it is standard practice to pay for return shipping on the drive you’re using the warranty for. The problem is paying shipping again to return the faulty drive they sent me.

9184
 
 
The original post: /r/datahoarder by /u/DetImplicitteSubjekt on 2024-06-12 12:48:20.

I'm a frequent internet user looking for an easy and reliable way to store my data. I write many personal notes daily, and I have a large number of photos (20k) on my Iphone. Additionally, I have over 700GB of various files, documents, and old system images.

I currently use icloud for my images. My files are either stored locally on my old Windows or in icloud drive on my Mac.

I own a Portable SSD, but i rarely use it. (still use 1,5 out of 2TB tho)

What storage solution would you recommend for my needs?

I've thought about something like this:

  1. Create a new Google account and purchase Google Drive storage.
  2. Upload all photos and icloud stuff to Google Drive, in a folder named "iCloud" for iCloud files and photos.
  3. Simultaneously, upload all files to the Portable SSD.
  4. Additionally, invest in a secondary SSD.
  5. Monthly backup the main SSD to the secondary SSD.
  6. Maintain a monthly routine of cleaning up iPhone photos, then upload them to both the Portable SSD and Google Drive.

What do you think? Will it be enough? Is it too much? Let me hear your thoughts. And please go easy on me😉

9185
 
 
The original post: /r/datahoarder by /u/Crafty_Future4829 on 2024-06-12 11:56:40.

I apologize in advanced as I know there are a lot related posts to buying refurbished/used hard drives for non critical data. On eBay, it seems you can get 16tb exos drives for around 160.00 or 10 per tb. They say refurbished with zero hours and the reseller (goharddrives) offers a 5 year warranty.

Where do these hard drives from? Do they really have zero hours as opposed to having smart data wiped? Is this a good deal?

I read another post being able to get used drives for around 5 dollars per tb.

What is your sweet spot and reliable ebay sellers you would buy from?

Thanks

9186
 
 
The original post: /r/datahoarder by /u/ImaginaryCheetah on 2024-06-12 20:19:19.

good afternoon,

i'm not particularly familiar with wget so i'm asking the experts for assistance...

my current string of wget -A "*(USA)*zip" -R -m -p -E -k -K -np -nd -w 2 https://target works fine to pull all files with "(USA)" included in the name, but i'd like to understand if i can get more complicated.

does the -R arg work in conjunction with -A or would it override ?

for example, wget -A "*(USA)*zip" -R "*(Demo)*" [etc] would this return all files with "(USA)" in the title unless it also had "(Demo)" in the file name ?

is there any way of passing a boolean criteria through wget ? "get files with (Europe) in the title if same file with (USA) in the title doesn't exist" kind of thing ?

i expect that might require grabbing a list of files with lftp and processing it instead of having wget do that kind of logic.

9187
 
 
The original post: /r/datahoarder by /u/Tiredcardinal on 2024-06-12 19:43:51.

I have a 4 year old WD elements 1.5tb hdd It was working completely fine until the starting of this year but all of a sudden it started giving buggy video outputs and I couldn't run games on it anymore So recently I decided to format it , only for it to be faulty a day later now I can copy any data to hdd normally but I cannot transfer anything for it Any suggestions or troubleshoots are welcome and appreciated

9188
 
 
The original post: /r/datahoarder by /u/cs_legend_93 on 2024-06-12 19:38:56.

Hello all

I'm trying to settle a debate with a other redditor.

Assume both of these scenarios are basic setups without drive pools or raid.

The redditor suggests and recommends that using a single SSD with a partition for the OS and Data drive is more safe than using dual drives.

I believe using a partitioned SSD will both double your chances of drive failure due to writes and reads, and it will make it a pain in the ass to restore the backup.

I suggested using two separate SSDs, and the redditor said that this indeed doubles the chances of drive failure due to two drives. I disagreed and said that it halves it due to the decreased reads and writes. I also suggested that dual drives will make it easier to restore a backup drive if one fails.

Which scenario is better?

In both scenerios there are backups, like a mirrored drive using Acronis disk imager or something like that. But it's still not a drive pool or RAID

Here is the debate: https://www.reddit.com/r/buildapc/s/ArnZMYuQSD

View Poll

9189
 
 
The original post: /r/datahoarder by /u/ResponsibilityIll888 on 2024-06-12 17:38:07.
9190
 
 
The original post: /r/datahoarder by /u/Electronic-Papaya on 2024-06-12 17:15:11.

I'm trying to free up some PCIe slots in my system so I'm switching from 2 x LSI 2008 to a single 16 port LSI 9300-16i. I'm running Linux and using mdadm to run 3 arrays.

Before I attached any of my drives with data and arrays configured, I installed the card with no drives attached and update the firmware to 16.00.12.00, and made sure it was in IT mode. As a test I then connected one of my arrays to the controller and booted the system. After booting up, the drives were detected fine (4 x 2TB) but the array was gone. It appears that the metadata was erased, mdadm didn't recognize any drive as being part of an array.

I was able to recover the data and the array by following the steps here: https://raid.wiki.kernel.org/index.php/Recovering_a_damaged_RAID

However, if I reboot again the same thing happens, the metadata is lost and mdadm does not recognize that the drives are part of an array. I do not have this issue with the older LSI controllers.

Any idea what's going on here? When I created the array I used the entre drive, so I did not create a Linux RAID partition on each drive. The array is configured using /dev/sda to /dev/sdd, and not /dev/sda1 to /dev/sdd1. Not sure that has anything to do with it.

Edit: Seems to be an issue with the controller and GPT partition tables. As a test, I created an array with a couple 120gb SSD's I had laying around. I created the array with the whole drive, rebooted, and after a reboot the array was still present. I realized my other drives are configured as GPT. So I wiped the SSDs, switched them to GPT and again created the array. This time after a reboot the array was gone, mdadm does not recognize the drives as being part of an array. Not sure how to fix this.

9191
 
 
The original post: /r/datahoarder by /u/Seagate_Surfer on 2024-06-12 15:51:39.

Hi r/DataHoarder crowd! We love your sub and would like to rev up another giveaway with the permission of the mod team.

The prize is: one 16TB IronWolf Pro Hard Disk Drive

How to enter:

Just reply to this post once with a top-level comment response on the following topic:

What kind of loyalty program matters to you for a company? Is drive capacity, price point, excellent customer service, etc. the highest priority for you? Please include the phrases RunWithIronWolf and Seagate in your comment.

Selection process/rules

One entry per person. Using alt accounts will result in a ban. New accounts created after this post went live are not eligible. Entries are open until June 27, 2024 at 23:59 UTC. We will use a random raffler utility to filter out top level comments (that is, top-level replies to this post, and not to another comment, and not on any cross-posts). The tool will remove duplicate usernames, sort the list, and grab the randomly chosen username, at which point the winner will be contacted within a day or so of the giveaway ending. Winners will have 48 hrs to get us their physical address and contact details for shipping (no PO boxes). Any person who does not reply in time loses their spot and everyone moves up a tier. For example: the 1st place person does not respond, so the 2nd place person gets contacted. Seagate will use the information strictly for shipping purposes only and will ship the drive directly. We reserve the right to edit this post including this process and these rules without notice. This is reddit, after all.

Geographic restrictions:

Our policy is for our forums and Reddit giveaways to be global where local shipping and/or giveaway restrictions/current world events don’t prevent us, however we are basing the below list of eligible counties from previous giveaways, as some counties have unique restrictions (e.g. the obvious shipping restrictions to Russia and Belarus currently)

US

Canada (will require a basic skills-based question if winner is chosen by law)

Brazil

South America

United Kingdom

Germany

France

Iberia

Australia

New Zealand

Korea

India

Malaysia

Singapore

China

9192
 
 
The original post: /r/datahoarder by /u/Vanillinn on 2024-06-12 15:39:15.

Hello! I only have my toe dipped in data hoarding but I've been meaning to buy the orico 6228US3 dual bay non cloning docking station.

Here is my use case:

  • Use my extra 3.5" 500GB HDD for my modded wii u (to be formatted for the wii u) it will be where my games are installed while running the console
  • back up photos, videos, school files, and game ROMs I have on future storage expansions (1-4TB at a time)
  • have two drives connected to my pc at the same time
  • on a very tight budget as a student

In the future, I would like to build my own NAS if I get into bigger data hoarding and archiving. It's something I want to do but do not have the budget for right now. I have a lot of files currently but 1 or 2 more extra drives would be enough. If I could, I'd like to own two copies of my files.

I have read some posts here on the subreddit raising issues on cooling, partitions, and corrupted drives. On cooling, would it still be an issue for a dual bay? I don't exactly understand the formatting problem of the partitions so I'd like to ask about that too. On corrupted drives, I'll always wait for the disk to stop spinning before ejection and also not move it as much as possible.

Is a dual bay docking station a good low cost entry into data hoarding or is there an alternative more appropriate for my use case?

9193
 
 
The original post: /r/datahoarder by /u/BulgyBoy123 on 2024-06-12 13:23:05.

Hi guys,

I'm looking for a software that is able to batch reverse search some images.

I downloaded all of my pinterest boards, but some of the files are really tiny. I wouldn't mind being able to download bigger versions of said files without having to spend weeks doing that manually.

9194
 
 
The original post: /r/datahoarder by /u/ThePixelHunter on 2024-06-12 13:22:06.
9195
 
 
The original post: /r/datahoarder by /u/marinluv on 2024-06-12 11:36:03.

Came across this post yesterday. That user and u/RaiderBDev are archiving Reddit data. The data is around 3-4Tb roughly from what I have seen.

The GitHub Repo to archive and access the data: Here

To download and search Subreddit and user data manually: Here

Post on r/pushshift for the 2.5TB dump: Here

9196
 
 
The original post: /r/datahoarder by /u/AtwoodEnterprise on 2024-06-12 11:18:53.

I have an SEO SaaS and I store a lot of data for keywords and backlinks.

My database is about 50gb and I’m currently paying about $80/mo for my webhosting VPS and up to 75gb of database storage.

Currently, I use a lot of API’s to pull SEO data down because it’s such a hassle to store that much data, but over the next year I want to try and up my game a bit.

So I’m gonna try and start off by increasing my keyword, and my backlink data to around 1 TB total if possible. The majority of the data is gonna be due to the backlink data of course.

Would anyone have any estimates for how much something like this might cost? Or what factors I should consider/ask when obtaining pricing?

9197
 
 
The original post: /r/datahoarder by /u/mamba_regime19 on 2024-06-12 11:07:25.

Is there an easy way to download larger songs/mp3's from Hulkshare?

Found some old shows no longer available anywhere else that I wouldn't mind having but the files stop playing after like 10 mins so my downloaders stop as well.

Is there anything like YoutubetoMp3 or SoundcloudtoMp3 that works?

Saw some older posts from 11years ago....not sure if anybody has figured it out since then.

Thanks!

9198
 
 
The original post: /r/datahoarder by /u/No-Balance-8038 on 2024-06-12 07:49:41.

I've got currently 5 backup 3.5" HDD disks which are planned to be stored offsite. And backup is earliest weekly. But the real issue is that according to the SATA specification, after 50 times replugging the disk, both the Server slot for 3.5" or the HDD itself can fail!

What do I do? Put the disks in a USB enclosure and never remove it again?

But what about keeping the HDD running in proper temperature? My current usb cases do not have fans, and USB is not as reliable. I know turning off UASP helps for that, but I am still kind of disappointed that I am not meant to regularly take backups from SATA disks.

I know theres also eSATA, but I am not sure which combination of that would be reliable! I could basically instead have a PCIe eSATA card and eSATA cases.

I even found two suitable cases https://www.turtlecase.eu/5-hdd/40-35-hard-drive-hdd-5-capacity-long-slots-waterproof-hd-5-turtle-case.html or https://www.feldherr.com/feldherr-esd-schaumstoff-set-euro-box-mit-16-faecher-fuer-3-5-zoll-festplatten/a-61567

What do you guys do recommend?

9199
 
 
The original post: /r/datahoarder by /u/green314159 on 2024-06-12 06:08:24.

A Seagate Barracuda 8TB drive with warranty expiration date sometime in 2025 has failed the other day. What is the best way to get in contact with technical support and get the warranty and replacement going? Sorry if this is the wrong place on Reddit to ask

9200
 
 
The original post: /r/datahoarder by /u/SolidShowerr on 2024-06-12 03:40:30.

I want to clone a landing page, I do everything right and download the files, but when I try to go to the index, it just opens the "Loading screen" from that page but not the page, it stays stuck on "Loading". Do you have a solution? Thank you

https://preview.redd.it/9ywe6e7ja26d1.jpg?width=1898&format=pjpg&auto=webp&s=abcc0cf04b1daff441293aa03c74b5b0dc36e770

view more: ‹ prev next ›